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(54)11116: HIV ENVELOPE POLYPEFITDES 

(57) Abstract 

A method for the rational design 
and prepaiatioo of vaccines based on HIV 
envelope polypeptides is described. In 
one embodiment, the method for mak- 
ing an HIV gpl20 subunit vaccine for a 
geographic region comprises determining 
neutralizing epitopes in the V2 and/or C4 
domains of gpl20 of HIV as depicted in 
the figure. In a preferred embodiment of 
the method, neutralizing epitopes for the 
V2, V3 and C4 donains of gpl20 arc 
determined. Also described are DNA se- 
quences encoding gpl20 from preferred 
vaccine strains of HIV. 
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HIV ENVELOPE POLYPEPTIDES 

FIELD OF THS lyVgNTXON 
5 This invention relates to the rational design and 

preparation of HIV vaccines based on HIV envelope 
polypeptides and the resultant vaccines. This 
invention further relates to improved methods for HIV 
serotyping and iiaimunogens which induce antibodies 
10 useful in the serotyping methods. 

BACKGROUND OF THE INVENTION 

Acquired immunodeficiency syndrome (AIDS) is 
caused by a retrovirus identified as the human 
15 immunodeficiency virus (HIV) . There have been intense 
effort to develop a vaccine. These efforts have 
focused on inducing antibodies to the HIV envelope 
protein. Recent efforts have used subunit vaccines 
where an HIV protein, rather than attenuated or killed 
20 virus, is used as the iramunogen in the vaccine for 
safety reasons. Subunit vaccines generally include 
gpl20, the portion of the HIV envelope protein which is 
on the surface of the virus. 

The HIV envelope protein has been extensively 
25 described, and the amino acid and RNA sequences 

encoding HIV envelope from a number of HIV strains are 
known (Myers, G. et al,, 1992. Human Retroviruses and 
AIDS. A compilation and analysis of nucleic acid and 
amino acid sequences. Los Alamos National Laboratory, 
3 0 Los Alamos, New Mexico) . The HIV envelope protein is a 
glycoprotein of about 160 kd (gpl60) which is anchored 
in the membrane bilayer at its carboxyl terminal 
region. The N-terminal segment, gpl20, protrudes into 
the aqueous environment surrounding the virion and the 
35 C-terminal segment, gp41, spans the membrane. Via a 
host-cell mediated process, gpl60 is cleaved to form 
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gpl20 and the integral membrane protein gp41. As there 
is no covalent attachment between gpl20 and gp41, free 
gpl20 is released from the surface of virions and 
infected cells. 
5 The gpl20 molecule consists of a polypeptide core 

of 60,000 daltons which is extensively modified by 
N-linked glycosylation to increase the apparent 
molecular weight of the molecule to 120,000 daltons. 
The amino acid sequence of gpl20 contains five 

10 relatively conserved domains interspersed with five 
hypervariable domains. The positions of the 18 
cysteine residues in the gpl20 primary sequence, and 
the positions of 13 of the approximately 24 N-linked 
glycosylation sites in the gpl20 sequence are common to 

15 all gpl20 sequences. The hypervariable domains contain 
extensive amino acid substitutions, insertions and 
deletions. Sequence variations in these domains result 
in up to 30% overall sequence variability between gpl20 
molecules from the various viral isolates. Despite 

20 this variation, all gpl20 sequences preserve the 

virus's ability to bind to the viral receptor CD4 and 
to interact with gp41 to induce fusion of the viral and 
host cell membranes. 

gpl20 has been the object of intensive 

25 investigation as a vaccine candidate for subunit 

vaccines, as the viral protein which is most likely to 
be accessible to immune attack. gpl20 is considered to 
be a good candidate for a subunit vaccine, because 
(i) gpl20 is known to possess the CD4 binding domain by 

30 which HIV attaches to its target cells, (ii) HIV 

infectivity can be neutralized in vitro by antibodies 
to gp 120, (iii) the majority of the in vitro 
neutralizing activity present in the serum of HIV 
infected individuals can be removed with a gpl20 

35 affinity column, and (iv) the gpl20/gp41 complex 
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appears to be essential for the transmission of HIV by 
cell-to-cell fusion. 

The identification of epitopes recognized by vinis 
neutralizing antibodies is critical for the rational 
5 design of vaccines effective against HIV^l infection. 
One way in which antibodies would be expected to 
neutralize HIV-l infection is by blocking the binding 
of the HIV-l envelope glycoprotein, gpl20, to its 
cellular receptor, CD4. However, it has been 

10 surprising that the CD4 blocking activity, readily 

demonstrated in sera from HIV-l infected individuals 
(31, 44) and animals immunized with recombinant 
envelope glycoproteins (1-3), has not always correlated 
with neutralizing activity (2, 31, 44). Results 

15 obtained with monoclonal antibodies have shown that 

while some of the monoclonal antibodies that block the 
binding of gpl20 to CD4 possess neutralizing activity, 
others do not (4, 7, 16, 26, 33, 35, 43, 45). When the 
neutralizing activity of CD4 blocking monoclonal 

20 antibodies are compared to those directed to the 

principal neutralizing determinant (PND) located in the 
third variable domain (V3 domain) of gpl20 (10, 39), 
the CD4 blocking antibodies appear to be significantly 
less potent. Thus, CD4 blocking monoclonal antibodies 

25 typically exhibit 50% inhibitory concentration values 
(ICjo) in the 1-10 ng/ml range (4, 16, 26, 33, 35, 43, 
45) whereas PND directed monoclonal antibodies 
typically exhibit IC50 values in the 0.1 to 1.0 fig /ml 
range (23, 33, 42) . 

30 Subunit vaccines, based on gpl20 or another viral 

protein, that can effectively induce antibodies that 
neutralize HIV are still being sought. However, to 
date no vaccine has not been effective in conferring 
' protection against HIV infection. 
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pESCRTPTTQN O F THE BACKGROUKD ART 

Recombinant subunit vaccines are described in 
Berman et al., PCT/US91/02250 (published as nxunber 
W091/15238 on 17 October 1991). See also, e.g. Hu et 
5 al., Nature 328:721-724 (1987) (vaccinia virus-HIV 

envelope recombinant vaccine); Arthur et al., J. Virol, 
63(12): 5046-5053 (1989) (purified gpl20) ; and Berman 
et al., Proc. Natl. Acad. Sci. USA 85:5200-5204 (1988) 
(recombinant envelope glycoprotein gpl20) . 

10 Numerous sequences for gpl20 are known. The 

sequence of gpl20 from the IIIB substrain of HIV-1lai 
referred to herein is that determined by Muesing et 
al., "Nucleic acid structure and expression of the 
human AIDS/lymphadenopathy retrovirus. Nature 

15 313:450-458 (1985). The sequences of gpl20 from the 
NY-5, Jrcsf, Z6, Z321, and HXB2 strains of HIV-1 are 
listed by Myers et al., "Human Retroviruses and AIDS; A 
compilation and analysis of nucleic acid and amino acid 
sequences," Los Alamos National Laboratory, Los Alamos, 

20 New Mexico (1992). The sequence of the Thai isolate 

A244 is provided by McCutchan et al., "Genetic Variants 
of HIV-l in Thailand," AIDS Res. and Human Retroviruses 
8:1887-1895 (1992). The MNj984 Clone is described by 
Gurgo et al. , "Envelope sequences of two new United 

25 States HIV-1 isolates," Virol. X64: 531-536 (1988). 
The amino acid sequence of this MN clone differs by 
approximately 2% from the MN-gpl20 clone (MNq^e) 
disclosed herein and obtained by Berman et al. 
Each of the above-described references is 

30 incorporated herein by reference in its entirety. 

SUMMARY OF THE INVENTION 

The present invention provides a method for the 
rational design and preparation of vaccines based on 
35 HIV envelope polypeptides. This invention is based on 
the discovery that there are neutralizing epitopes in 
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the V2 and C4 domains of gpl20, in addition to the 
neutralizing epitopes in the V3 domain. In addition, 
the amount of variation of the neutralizing epitopes is 
highly constrained, facilitating the design of an HIV 
5 subunit vaccine that can induce antibodies that 
neutralize a plurality of HIV strains for a given 
geographic region. 

In one embodiment , ' the present invention provides 
a method for making an HIV gpl20 subunit vaccine for a 

10 geographic region in which a neutralizing epitope in 

the V2 and/or C4 domains of gpl20 of HIV isolates from 
the geographic region is determined and an HIV strain 
having gpl20 which has a neutralizing epitope in the V2 
or C4 domain which is common among isolates in the 

15 geographic region is selected and used to make the 
vaccine. 

In a preferred embodiment of the method, 
neutralizing epitopes for the V2, V3, and 04 domains of 
gpl20 from HIV isolates from the geographic region are 

20 determined. At least two HIV isolates having different 
neutralizing epitopes in the V2, V3, or C4 domain are 
selected and used to make the HIV gpl20 subunit 
vaccine. Preferably, each of the selected isolates 
have one of the most common neutralizing epitopes for 

25 the V2, V3, or C4 domains. 

The invention also provides a multivalent HIV 
gpl20 subunit vaccine. The vaccine comprises gpl20 
from two isolates of HIV having at least one different 
neutralizing epitope. Preferably, the isolates have 

3 0 the most common neutralizing epitopes in the geographic 
region for one of the domains. 

A DNA sequence of less than 5 kilobases encoding 
gpl20 from preferred vaccine strains of HIV, GNEg and 
GNEifi, expression construct comprising the GNEj-gpl20 and 

35 GNEi5-gpl20 encoding DNA under the transcriptional and 
translational control of a heterologous promoter, and 
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isolated GNE8-gpl20 and GNEi^-gpiao are also provided. 
The invention ftirther provid s improved methods f r HIV 
serotyping in which epitopes in the V2 or C4 domains of 
gpl20 are determined and provides immunogens (triincated 
5 gpl20 sequences) which induce antibodies useful in the 
serotyping methods. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 describes inhibition of CD4 binding by 

10 monoclonal antibodies to recombinantly produced gpl20 
from the MN strain of HIV (MN-rgpl20) . Mice were 
immunized with MN-rgpl20 and the resulting splenocytes 
were fused with the NP3X63 . Ag8 . 653 cell line as 
described in Example 1. Thirty-five stable hybridoma 

15 clones, reactive with MN-rgpl20 were identified by 

ELISA. Secondary screening revealed seven cell lines 
(1024, 1093, 1096, 1097, 1110, 1112, and 1027) 
secreting antibodies able to inhibit the binding of 
MN-rgpl20 to biotin labeled recombinantly produced CD4 

20 (rsCD4) in a ELISA using HRPO-strepavadin. Data 
obtained with monoclonal antibodies from the same 
fusion (1026, 1092, 1126) that failed to inhibit 
MN-rgpl20 binding to CD4 is shown for purposes of 
comparison. 

25 FIGURE 2 shows neutralizing activity of CD4- 

blocking monoclonal antibodies to MN-rgpl20, 
Monoclonal antibodies that blocked the binding of 
MN-rgpl20 to CD4 were screened for the capacity to 
inhibit the infection of MT2 cells by the MN strain of 

30 HIV-1 in vitro. Cell free virus was added to wells 

containing serially diluted antibodies and incxxbated at 
4**C for 1 hr. After incubation, MT-2 cells were added 
to the wells and the cultures were then grown for 5 
days at 37«C. Cell viability was then measured by 

35 addition of the colorimetric tetrazolium compound MTT 
as described in reference (35) of Example 1. The 
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optical densities f each well were measured at 540 nm 
using a microtit r plat reading spectrophotometer. 
Inhibition of virus infectivity was calculated by 
dividing the mean optical densities from wells 
containing monoclonal cuitibodies by the mean value of 
wells that received virus alone. Monoclonal antibodies 
that blocked CD4 binding are the same as those 
indicated in Figure Legend 1. Data from the V3- 
directed monoclonal antibody to MN-rgpl20 (1034) is 
provided as a positive control. Data obtained with the 
V3 directed monoclonal antibody, 11G5/ specific for the 
IIIB strain of HIV-1 (33) is shown as a negative 
control. 

FIGURE 3 is a diagram of gpl20 fragments used to 
localize the epitopes recognized by the CD4 blocking 
monoclonal antibodies to MN-rgpl20. A series of 
fragments (A) corresponding to the V4 and C4 domains 
(B) (SEQ, ID. NO. 14) of the gene encoding MN-rgpl20 
were prepared by PCR. The gpl20 gene fragments were 
fused to a fragment of the gene encoding Herpes Simplex 
Virus Type 1 glycoprotein D that encoded the signal 
sequence and 25 amino acids from the mature amino 
terminus. The chimeric genes were assembled into a 
mammalian cell expression vector {PRK5) that provided a 
CMV promoter, translational stop codons and an SV40 
polyadenylation site. The embryonic human kidney 
adenocarcinoma cell line, 293s, was transfected with 
the resulting plasmid and recombinant proteins were 
recovered from growth conditioned cell culture medium. 
Fragments of MN-rgpl20, expressed as HSV-i Gd fusion 
proteins, were produced by transient transfection of 
293s cells (Example 1) . To verify expression, cells 
were metabolically labeled with [^^S] -methionine, and 
the resulting growth conditioned cell cultiire 
supernatants were immunoprecipitated (C) using a 
monoclonal antibody, 5B6, specific for the aunino 
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terminus of HSV-1 Gd and fixed S. aureus. The 
immunoprecipitated proteins were resolved on 4 to 20 % 
acrylamide gradient gels using SDS-PAGE and visualized 
by autoradiography. The samples were: Lane 1, 
5 FMN. 368-408; lane 2, FMN. 368-451; lane 3, FMN. 419-443; 
lane 4, FMN. 414-451; lane 5, MN-rgpl20, The gel 
demonstrated that the proteins were expressed and 
migrated at the expected molecular weights. 

FIGURE 4 shows a 04 domain sequence comparison 

10 (SEQ. ID, Nos. 3-13), The 04 domain amino acid 

sequences of recombinant and virus derived gpl20s used 
for monoclonal antibody binding studies were aligned 
starting the amino terminal cysteine* Amino acid 
positions are designated with respect to the sequence 

15 of MN-rgpl20. Sequences of the LAI substrains, IIIB, 
BHIO, Bru, HXB2, and HXB3 are shown for purposes of 
comparison. 

FIGURE 5 shows sequences of 04 domain mutants of 
MN-rgpl20 (SEQ. ID. Nos. 3 and 15-23). Nucleotide 

20 substitutions, resulting in the amino acid sequences 
indicated, were introduced into the 04 domain of 
MN-rgpl20 gene using recombinant FOR. The resulting 
variants were assembled into the expression plasmid, 
pRK5, which was then transfected into 293s cells. The 

25 binding of monoclonal antibodies to the resulting 04 
domain variants was then analyzed (Table 5) by ELISA. 

FIGURE 6 illustrates the reactivity of monoclonal 
antibody 1024 with HIV-Il^i substrains. The cell 
surface binding of the 04 domain reactive monoclonal 

30 antibody 1024 to H9 cells chronically infected with the 
IIIB, HXB2, HXB3, and HXBIO sxibstrains of HIV-1 LAI or 
HIV-lMN was analyzed by flow cytometry. Oultxires of 
virus infected cells were reacted with either 
monoclonal antibody 1024, a nonrelevant monoclonal 

35 antibody (control) , or a broadly cross reactive 
monoclonal antibody (1026) raised against rgpl20. 
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After washing away unbound monoclonal antibody, the 
cells were then labeled with fluorescein conjugated 
goat antibody t mouse IgG (Fab*),, washed and fixed 
with paraformaldehyde. The resulting cells were 
analyzed for degree of fluorescence intensity using a 
FACSCAN (Becton Dickenson, Fullerton, CA) . 
Fluorescence was measured as mean intensity of the 
cells expressed as mean channel number plotted on a log 
scale. 

FIGURE 7 shows the determination of the binding 
affinity of monoclonal antibodies for MN-rgpl20. CD4 
blocking monoclonal antibodies raised against MN-rgpl20 
(1024 and 1097) or IIIB-rgpl20 {13H8 and 5C2) were • 
labeled with and binding titrations using 

MN-rgpl20 (A and B) or IIIB-rgpl20 (C and D) were 
carried out as described in the Example 1. A, binding 
of monoclonal antibody 1024; B binding of monoclonal 
antibody 1097; C, binding of monoclonal antibody 13H8; 
and D binding of monoclonal antibody 5C2. 

FIGURE 8 shows the correlation between gpl20 
binding affinity (K^) and neutralizing activity (IC50) 
of monoclonal antibodies to the C4 domain of MN-rgpl20, 
Binding affinities of monoclonal antibodies to the C4 
domain of gpl20 were determined by Scatchard analysis 
(Figure 9, Table 5). The resulting values were plotted 
as a function of the log of their neutralizing 
activities (IC50) determined in Figiire 2 and Table 6. 

FIGURE 9 depicts the amino acid sequence of the 
mature envelope glycoprotein (gpl20) from the KNam 
clone of the MN strain of HIV-1 (SEQ. ID. NO. 1) . 
Hypervariable domains are from 1-29 (signal sequence) , 
131-156, 166-200,305-332, 399-413, and 460-469. The V 
and C regions are indicated (according to Modrow et 
al., J. l^irology 61(2) :570 (1987). Potential 
glycosylation sites are marked with a (*) . 



FIGURE 10 depicts the amino acid sequence of a 
fusi n protein of the residues 41-511 of the mature 
envelope glycoprotein (gpl20) from the MNcn^ clone of 
the MN strain of HIV-1, and the gD-1 amino terminus 
from the herpes simplex glycoprotein gD-1- (SEQ. ID. 
NO. 2) • The V and C regions are indicated (according 
to Modrow et al . , J- Virology 61(2) :570 (1987) • 
Potential glycosylation sites are marked with a (*). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a method for the 
rational design and preparation of vaccines based on 
HIV envelope polypeptides. This invention is based on 
the discovery that there are neutralizing epitopes in 
the V2 and C4 domains of gpl20, in addition to the 
neutralizing epitopes in the V3 domain. Although the 
amino acid sequences of the neutralizing epitopes in 
the V2, V3, and C4 domains are variable, it has now 
been fo\ind that the amount of variation is highly 
constrained. The limited amount of variation 
facilitates the design of an HIV subunit vaccine that 
can induce antibodies that neutralize the most common 
HIV strains for a given geographic region. In 
particular, the amino acid sequence of neutralizing 
epitopes in the V2, V3, and C4 domains for isolates of 
a selected geographic region is determined. gpl20 from 
isolates having the most common neutralizing epitope 
sec[uences are utilized in the vaccine. 

The invention also provides a multivalent gpl20 
subunit vaccine wherein gpl20 present in the vaccine is 
from at least two HIV isolates which have different 
amino acid sequences for a neutralizing epitope in the 
V2, V3, or C4 domain of gpl20. The invention further 
provides improved methods for HIV serotyping in which 
epitopes in the V2 or C4 domains of gpl20 are 
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determined and provides imnunogens which induce 
antibodies useful in the serotyping methods. 

The term "subunit vaccine" is used herein, as in 
the art, to refer to a viral vaccine that does not 
contain virus, but rather contains one or more viral 
proteins or fragments of viral proteins. As used 
herein, the term "multivalent" means that the vaccine 
contains gpl20 from at least two HIV isolates having 
different amino acid sequences for a neutralizing 
epitope. 

Vaccine Design Method 

The vaccine design method of this invention is 
based on the discovery that there are neutralizing 
epitopes in the V2 and C4 domains of gpl20, in addition 
to those found in the principal neutralizing domain 
(PND) in the V3 domain. Selecting an HIV isolate with 
appropriate neutralizing epitopes in the V2 and/or C4 
domains provides a vaccine that is designed to induce 
immunity to the HIV isolates present in a selected 
geographic region. In addition, although the amino 
acid sequence of the V2, V3, and C4 domains containing 
the neutralizing epitopes is variable, the amovmt of 
variation is highly constrained, facilitating the 
design of a multivalent vaccine which can neutralize a 
plurality of the most common HIV strains for a given 
geographic region. 

The method for making an HIV gpl20 subunit vaccine 
depends on the use of appropriate strains of HIV for a 
selected geographic region. Appropriate strains of HIV 
for the region are selected by determining the 
neutralizing epitopes for HIV isolates and the 
percentage of HIV infections attributable to each 
strain present in the region. HIV strains which have 
the most common neutralizing epitopes in the V2 or C4 
domains in the geographic region are selected. ^ 
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Preferably, isolates that confer protection against the 
most c romon neutralizing epitopes in the V2, V3, and C4 
domains for a geographic region are selected. 

One extibodiment of the method for making an HIV 
5 gpl20 subunit vaccine from appropriate strains of HIV 
for a geographic region comprises the following steps* 
A neutralizing epitope in the V2 or C4 domain of gpl20 
of HIV isolates from the geographic region is 
determined. An HIV strain having gpl20 with a 

10 neutralizing epitope in the V2 or C4 domain that is 

common among HIV isolates in the geographic region is 
selected. gpl20 from the selected isolate is used to 
make an HIV gpl20 subunit vaccine. 

In another embodiment of the method, the 

15 neutralizing epitopes in the V2, V3, and C4 domains of 
gpl20 from HIV isolates from the geographic region are 
determined. At least two HIV isolates having different 
neutralizing epitopes in the V2, V3, or C4 domain are 
selected and used to make an HIV gpl20 subunit vaccine. 

20 Preferably, the vaccine contains gpl20 from at least 
the two or three HIV strains having the most common 
neutralizing epitopes for the V2, V3, or C4 domains. 
More preferably, the vaccine contains gpl20 from 
sufficient strains so that at least about 50%, 

25 preferably about 70%, more preferably about 80% or more 
of the neutralizing epitopes for the V2, V3, and C4 
domains in the geographic region are included in the 
vaccine. The location of the neutralizing epitopes in 
the V3 region are well known. The location of the 

30 neutralizing epitopes in the V2 and C4 regions are 
described hereinafter. 

Each of the steps of the method are described in 
detail below. 

35 Determining neutra lizing epitopes 
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The first step in designing a vaccine for a 
selected geographic regi n is to determine the 
neutralizing epitopes in the gpl20 V2 and/or C4 
domains. In a preferred embodiment, neutralizing 
5 epitopes in the V3 domain (the principal neutralizing 
domain) are also determined. The location of 
neutralizing epitopes in the V3 domain is well known. 
Neutralizing epitopes in the V2 and C4 domains have now 
been found to be located between about residues 163 and 

10 200 and between about residues 420 and 440, 

respectively. In addition, the critical residues for 
antibody binding are residues 171, 173, 174, 177, 181, 
183, 187, and 188 in the V2 domain and residues 429 and 
432 in the C4 domain, as described in detail in the 

15 Examples * 

The neutralizing epitopes for any isolate can be 
determined by sequencing the region of gpl20 containing 
the neutralizing epitope. Alternatively, when 
antibodies specific for the neutralizing epitope, 

20 preferably monoclonal antibodies, are availeJDle the 

neutralizing epitope can be determined by serological 
methods as described hereinafter. A method for 
identification of additional neutralizing epitopes in 
gpl20 is described hereinafter. 

25 When discussing the amino acid sequences of 

various isolates and strains of HIV, the most common 
numbering system refers to the location of amino acids 
within the gpl20 protein using the initiator methionine 
residue as position 1. The amino acid nvimbering 

30 reflects the mature HIV-1 gpl20 amino acid sequence as 
shown by Figures 9 and Fig. 10 [SEQ. ID Nos. 1 and 2]. 
For gpl20 sequences derived from other HIV isolates and 
which include their native HIV N-terminal signal 
sequence, numbering may differ. Although the 

35 nucleotide and amino acid residue niimbers may not be 

applicable in other strains where upstream deletions or 
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insertions change the length of the viral genome cind 
gpl20, the regi n encoding the portions of gpl20 is 
readily identified by reference to the teachings 
herein. The variable (V) domains and conserved (C) 
S domains of gpl20 are specified according to the 
nomenclature of Modrow et al. "Computer-assisted 
analysis of envelope protein sequences of seven human 
immunodeficiency virus isolates: predictions of 
antigenic epitopes in conserved and variable regions," 

10 J. Virol. 61:570-578 (1987). 

The first step in identifying the neutralizing 
epitopes for any region of gpl20 is to immunize an 
animal with gpl20 to induce anti-gpl20 antibodies. The 
antibodies can be polyclonal or, preferably, 

15 monoclonal. Polyclonal antibodies can be induced by 
administering to the host animal an immunogenic 
composition comprising gpl20. Preparation of 
immunogenic compositions of a protein may vary 
depending on the host animal and the protein and is 

20 well known. For example, gpl20 or an antigenic portion 
thereof can be conjugated to an immunogenic substance 
such as KLH or BSA or provided in an adjuvant or the 
like. The induced antibodies can be tested to 
determine whether the composition is specific for 

25 gpl20. If a polyclonal antibody composition does not 
provide the desired specificity, the antibodies can be 
fractionated by ion exchange chromatography and 
immunoaf f inity methods using intact gpl2 0 or various 
fragments of gpl20 to enhance specificity by a variety 

30 of conventional methods. For example, the composition 
can be fractionated to reduce binding to other 
substances by contacting the composition with gpl20 
affixed to a solid substrate. Those antibodies which 
bind to the substrate are retained. Fractionation 

35 techniques using antigens affixed to a variety of solid 
substrates such as affinity chromatography materials 
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including Sephadex, Sepharose and the like are well 
known. 

Monoclonal anti-gpl20 antibodies can be produced 
by a number of conventional methods. A mouse can be 
5 injected with an immunogenic composition containing 
gpl20 and spleen cells obtained. Those spleen cells 
can be fused with a fusion partner to prepare 
hybridomas. Antibodies secreted by the hybridomas can 
be screened to select a hybridoma wherein the 
10 antibodies neutralize HIV infectivity, as described 

hereinafter. Hybridomas that produce antibodies of the 
desired specificity are cultured by standard 
technic[ues • 

Infected human lymphocytes can be used to prepare 

15 human hybridomas by a number of techniques such as 

fusion with a murine fusion partner or transformation 
with EBV. In addition, combinatorial libraries of 
hiiman or mouse spleen can be expressed in £. coli to 
produce the antibodies. Kits for preparing 

20 combinatorial libraries are commercially available. 
Hybridoma preparation techniques and culture methods 
are well known and constitute no part of the present 
invention. Exemplary preparations of monoclonal 
antibodies are described in the Examples. 

25 Following preparation of anti-gpl20 monoclonal 

antibodies, the antibodies are screened to determine 
those antibodies which are neutralizing antibodies. 
Assays to determine whether a monoclonal antibody 
neutralizes HIV infectivity are well known and are 

30 described in the literature. Briefly, dilutions of 

antibody and HIV stock are combined and incubated for a 
time sufficient for antibody binding to the virus. 
Thereafter, cells that are susceptible to HIV infection 
are combined with the virus /antibody mixture and 

35 cultured. MT-2 cells or H9 cells are susceptible to 
infection by most HIV strains that are adapted for 
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growth in the laboratory. Activated peripheral blood 
mononuclear cells (PBMCs) or macrophages can be 
infected with primary isolates (isolates from a patient 
specimens which have not been cultured in T-cell lines 
or transformed cell lines). Daar et al, Proc. Natl. 
Acad, Sci. USA 87:6574-6578 (1990) describe methods for 
infecting cells with primary isolates. 

After culturing the cells for about five days, the 
number of viable cells is determined, as by measuring 
metabolic conversion of the formazan MTT dye. The 
percentage of inhibition of infectivity is calculated 
to determine those antibodies that neutralize HIV. An 
exemplary preferred procedure for determining HIV 
neutralization is described in the Examples. 

Those monoclonal antibodies which neutralize HIV 
are used to map the epitopes to which the antibodies 
bind. To determine the location of a gpl20 
neutralizing epitope, neutralizing antibodies are 
combined with fragments of gpl20 to determine the 
fragments to which the antibodies bind. The gpl20 
fragments used to localize the neutralizing epitopes 
are preferably made by recombinant DNA methods as 
described hereinafter and exemplified in the Examples. 
By using a plurality of fragments, each encompassing 
different, overlapping portions of gpl20, an amino acid 
sequence encompassing a neutralizing epitope to which a 
neutralizing antibody binds can be determined. A 
preferred exemplary determination of the neutralizing 
epitopes to which a series of neutralizing antibodies 
binds is described in detail in the Examples. 

This use of overlapping fragments can narrow the 
location of the epitope to a region of about 20 to 40 
residues. To confirm the location of the epitope and 
narrow the location to a region of about 5 to 10 
residues, site-directed mutagenicity studies are 
preferably performed. Such studies can also determine 
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the critical residues for binding of neutralizing 
antibodies. A preferred exemplary site-directed 
mutagenicity procedure is described in the Examples. 

To perform site-directed mutagenicity studies, 
recombinant PGR techniques can be utilized to introduce 
single amino acid substitutions at selected sites into 
gpl20 fragments containing the neutralizing epitope. 
Briefly, overlapping portions of the region containing 
the epitope are amplified using primers that 
incorporate the desired nucleotide changes. The 
resultant PGR products are annealed and amplified to 
generate the final product. The final product is then 
expressed to produce a mutagenized gpl20 fragment. 
Expression of DNA encoding gpl20 or a portion thereof 
is described hereinafter and exemplified in the 
Examples. 

In a preferred embodiment described in Example 1, 
the gpl20 fragments are expressed in mammalian cells 
that are capable of expression of gpl20 fragments 
having the same glycolsylation and disulfide bonds as 
native gpl20. The presence of proper glycolsylation 
and disulfide bonds provides fragments that are more 
likely to preserve the neutralizing epitopes than 
fragments that are expressed in E. coli, for example, 
which lack disulfide bonds and glycosylation or are 
chemically synthesized which lack glycolsylation and 
may lack disulfide bonds. 

Those mutagenized gpl20 fragments are then used in 
an immunoassay using gpl20 as a control to determine 
the mutations that impair or eliminate binding of the 
neutralizing antibodies. Those critical auaino acid 
residues form part of the neutralizing epitope that can 
only be altered in limited ways without eliminating the 
epitope. Each alteration that preserves the epitope 
can be determined. Such mutagenicity studies 
demonstrate the variations in the amino acid sequence 
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of the neutralizing epitope that provide equivalent or 
diminished binding by neutralizing antibodies or 
eliminate antibody binding. Although the aunino acid 
sequence of gpl20 used in the vaccine preferably is 
5 identical to that of a selected HIV isolate for the 
given geographic region, alterations in the aunino acid 
sequence of neutralizing epitope that are suitable for 
use in a vaccine can be determined by such studies. 
Once a neutralizing epitope is localized to a 
10 region of ten to twenty amino acids of gpl20, the amino 
acid sequence of corresponding neutralizing epitopes of 
other HIV isolates can be determined by identifying the 
corresponding portion of the gpl20 amino acid sequence 
of the isolate. 

15 Once the neutralizing epitopes for a given region 

of gpl20 are determined, the amino acid sequence of HIV 
isolates for the geographic region are determined. The 
complete amino acid sequence for numerous isolates has 
been determined and is available from numerous journal 

20 articles and in databases. In such cases, 

determination of the amino acid sequence of HIV 
isolates for the geographic region involves looking up 
the sequence in an appropriate database or journal 
article. However, for some isolates, the amino acid 

25 sequence information does not include the. sequence of 
the V2 or C4 domains. 

When the amino acid sequence of a region of 
interest for a given isolate is not known, the amino 
acid sequence can be determined by well known methods. 

30 Methods for determining the amino acid sequence of a 
protein or peptide of interest are well known and are 
described in numerous references including Maniatis et 
al.. Molecular Cloning — A Laboratory Manual, Cold 
Spring Harbor Laboratory (1984). In addition, 

35 automated instruments which sequence proteins are 
commercially available. 
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Alternatively, the nucleotide sequence of DNA 
ncoding gpl20 r a relevant p rtion f gpl20 can be 
determined and the amino acid sequence of gpl20 can be 
deduced. Methods for amplifying gpl20-encoding DNA 
5 from HIV isolates to provide sufficient DNA for 

sequencing are well known. In particular^ Ou et al. 
Science 256:1165-1171 (1992); Zhang et al. AIDS 
5:675-681 (1991); and Wolinsky Science 255:1134-1137 
(1992) describe methods for amplifying gpl20 DNA. 

10 Sequencing of the amplified DNA is well known and is 
described in Maniatis et al., Molecular Cloning — A 
Laboratory Manual, Cold Spring Harbor Laboratory 
(1984), and Horvath et al., An Automated DNA 
Synthesizer Employing Deoxynucleoside 

15 3 '-Phosphoramidites, Methods in Enzymology 154: 

313-326, (1987), for example. In addition, automated 
instruments that sequence DNA are commercially 
available. 

20 In a preferred embodiment, the isolate is a 

patient isolate which has not been passaged in culture. 
It is known that following passage in T-cells, HIV 
isolates mutate and isolates best suited for growth 
under cell culture conditions are selected. For 

25 example, cell culture strains of HIV develop the 

ability to form syncytia. Therefore, preferably the 
amino acid sequence of gpl20 is determined from a 
patient isolate prior to growth in culture. Generally, 
DNA from the isolate is amplified to provide sufficient 

30 DNA for sequencing. The deduced amino acid sequence is 
used as the amino acid sequence of the isolate, as 
described hereinbefore. 

To determine the percentage each isolate 
constitutes of total HIV that infects individuals in 

35 the geographic region, standard epidemiological methods 
are used. In particular, sufficient isolates are 
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sequenced to ensure confidence that the percentage of 
each isolate in the g ographic region has been 
determined. For example, Ichimura et al, AIDS Res. 
Hum. Retroviruses 10:263-269 (1994) describe an 
5 epidemiological study in Thailand that determined that 
there are two strains of HIV present in the region. 
HIV strains have only recently been present in Thailand 
and Thailand, therefore has the most homogenous 
population of HIV isolates known to date. The study 

10 sequenced 23 isolates from various parts of the country 
and determined that only two different amino acid 
sequences were present in the isolates. 

In contrast, HIV has been infecting individuals in 
Africa for the longest period of any geographic region. 

15 In Africa, each of the most common isolates probably 

constitutes about 5% of the population. In such cases, 
more isolates would need to be sequenced to determine 
the percentage each isolate constitutes of the 
population. Population studies for determining the 

20 percentage of various strains of HIV, or other viruses, 
present in a geographic region are well known and are 
described in, for example, Ou et al. Lancet 341:1171- 
1174 (1993); Ou et al, AIDS Res. Hum. Retroviiruses 
8:1471-1472 (1992); and McCutchan et al., AIDS Res. 

25 Hum. Retroviruses 8:1887-1895 (1992). 

In the United States and western Europe, probably 
about two to four different neutralizing epitopes in 
each of the V2, V3, and C4 domains constitute 50 to 70% 
of the neutralizing epitopes for each domain in the 

30 geographic region, as described more fully hereinafter. 

Selection method 

Once the amino acid sequence of neutralizing 
epitopes for strains in a region are determined, gpl20 
35 from an HIV strain having gpl20 that has an amino acid 
sequence for a neutralizing epitope in the V2 or C4 
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domain which sequence is one of the most common in the 
g ographic region is selected « One of the most common 
neutralizing epitope amino acid sequences mecms that 
the strain has an amino acid sequence for at least one 
5 neutralizing epitope that is occurs among the most 
frequently for HIV isolates in the geographic region 
and thus is present as a significant percentage of the 
population. For example, if there are three sequences 
for a neutralizing epitope that constitute 20, 20, and 
10 40 percent of the sequences for that epitope in the 

region and the remainder of the population is comprised 
by 2 to 4 other sequences, the three sequences are the 
most common. Therefore, in African countries, if each 
of several amino acid sequences constitute about 5% of 
15 the sequences for a neutralizing epitope and the 

remainder of the sequences each constitute less than 1% 
of the population, the isolates that constitute 5% of 
the population are the most common. 

Preferably, isolates having the most common amino 
20 acid sequences for a neutralizing epitope are chosen. 
By the most common is meant that the sequences occur 
most frequently in the geographic region. For example, 
in the United States, the MN isolate has a C4 
neutralizing epitope that comprises at least about 45% 
25 of the population. The GNEg isolate has a 04 

neutralizing epitope that comprises at least about 45% 
of the population. Thus either isolate has the most 
common C4 neutralizing epitope in the region. When 
gpl20 from each isolate is combined in a vaccine, 
30 greater than about 90% of the 04 neutralizing epitope 

sequences are present in the vaccine. In addition, the 
amino acid sequences for the V3 neutralizing epitope in 
the MN and GNEj isolates are substantially similar and 
^ comprise about 60% of the population. Therefore, those 

35 strains have the two most common neutralizing epitopes 
for the V3 domain. In the V2 region, the MN isolate 
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amino acid sequences comprises about 10% of the 
population, and the GNEg isolate amino acid sequences 
comprises about 60% of the population. Therefore, the 
GNEs strain has the most common neutralizing epitope for 
5 the region and the two strains together comprise the 
two most common neutralizing epitopes for the region. 
A multivalent gpl2Q subunit vaccine containing the two 
isolates contains amino acid sequences for epitopes 
that constitute about 70% of the V2 domain, about 60% 

10 of the V3 domain, and about 90% of the C4 domain for 
the United States. 

In a preferred embodiment of the method, one or 
more HIV isolates having an amino acid sequence for a 
neutralizing epitope in the V2 and/or C4 domains that 

15 constitute at least about 50% of the population for a 
selected geographic region are selected. In a more 
preferred embodiment, isolates having the most common 
neutralizing epitopes in the V3 domain are also 
included in the vaccine. 

20 As is clear, once the most common amino acid 

sequences for the neutralizing epitopes in the V2, V3, 
and C4 domains are known, an isolate having a common 
epitope for each region is preferably selected. That 
is, when only two or three isolates are used for the 

25 vaccine, it is preferable to select the isolate for 

common epitopes in each region, rather than selecting 
an isolate by analysis of a single region. 

In a more preferred embodiment, gpl20 from 
isolates having epitopes that constitute at least 50% 

30 of the population for the geographic region for V2, V3, 
and C4 domains are present in the vaccine. More 
preferably, the isolates have epitopes that constitute 
at least 60% of the population for the geographic 
region for the three domains. Most preferably, 70% or v 

35 more are included. 
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In another preferred eiobodlment, the entire amino 
acid sequence of the V2 and C4 domains is determined in 
the selection process. In addition to selecting common 
sec[uences for the neutralizing epitopes, isolates 
having unusual polymorphisms elsewhere in the region 
are preferably not used for the vaccine isolates. 

Vaccine nreoaration 

gpl20 from the selected HIV isolate (s) is used to 
make a subunit vaccine, preferably a multivalent 
subunit vaccine. Preparation of gpl20 for use in a 
vaccine is well known and is described hereinafter. 
With the exception of the use of the selected HIV 
isolate, the gpl20 subunit vaccine prepared in the 
method does not differ from gpl20 subunit vaccines of 
the prior art. 

As with prior art gpl20 subunit vaccines, gpl20 at 
the desired degree of purity and at a sufficient 
concentration to induce antibody formation is mixed 
with a physiologically acceptable carrier. A 
physiologically acceptable carrier is nontoxic to a 
recipient at the dosage and concentration employed in 
the vaccine. Generally, the vaccine is formulated for 
injection, usually intramuscular or subcutaneous 
injection. Suitable carriers for injection include 
sterile water, but preferably are physiologic salt 
solutions, such as normal saline or buffered salt 
solutions such as phosphate buffered saline or ringer *s 
lactate. The vaccine generally contains an adjuvant. 
Useful adjuvants include QS21 which stimulates 
cytotoxic T-cells and alum (aluminum hydroxide 
adjuvant) . Formulations with different adjuvemts which 
enhance cellular or local immunity can also be used. 

Addition excipients that can be present in the 
vaccine include low molecular weight polypeptides (less 
than about 10 residues) , proteins, amino acids, 



carbohydrates including glucose or dextrans, chelating 
agents such as EOTA, and other excipients. 

The vaccine can also contain other HIV proteins. 
In particular, gp41 or the extracellular portion of 
gp41 can be present in the vaccine. Since 9p41 has a 
conserved amino acid sequence, the gp41 present in the 
vaccine can be from any HIV isolate. gpl60 from an 
isolate used in the vaccine can replace gpl20 in the 
vaccine or be used together with gpl2 0 from the 
isolate. Alternatively, gpl60 from an isolate having a 
different neutralizing epitope than those in the 
vaccine isolates can additionally be present in the 
vaccine . 

Vaccine formulations generally include a total of 
about 300 to 600 /xg of gpl20, conveniently in about 1.0 
ml of carrier. The amount of gpl20 for any isolate 
present in the vaccine will vary depending on the 
immunogenicity of the gpl20. For example, gpl20 from 
the Thai strains of HIV are much less immunogenic than 
gpl20 from the MN strain. If the two strains were to 
be used in combination, empirical titration of the 
amount of each virus would be performed to determine 
the percent of the gpl20 of each strain in the vaccine. 
For isolates having similar immunogenicity, 
approximately ec[ual amounts of each isolate 's gpl20 
would be present in the vaccine. For example, in a 
preferred embodiment, the vaccine includes gpl20 from 
the MN, GNEg, and GNEi^ strains at concentrations of 
about 300 P^^ strain in about 1.0 ml of carrier. 
Methods of determining the relative amount of an 
immunogenic protein in multivalent vaccines are well 
known and have been used, for example, to determine 
relative proportions of various isolates in multivalent 
polio vaccines. 

The vaccines of this invention are administered in 
the same manner as prior art HIV gpl20 subunit 
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vaccines. In particular, the vaccines are generally 
acJiainistered at 0^ 1, and at 6, 8 or 12 months, 
depending on the protocol. Following the immunization 
procedure, annual or bi-annual boosts can be 
5 administered. However, during the immunization process 
and thereafter, neutralizing antibody levels can be 
assayed and the protocol adjusted accordingly. 

The vaccine is administered to uninfected 
individuals. In addition, the vaccine can be 

10 administered to seropositive individuals to augment 
immune response to the virus, as with prior art HIV 
vaccines. It is also contemplated that DNA encoding 
the strains of gpl20 for the vaccine can be 
administered in a suitable vehicle for expression in 

15 the host. In this way, gpl20 can be produced in the 
infected host, eliminating the need for repeated 
immunizations. Preparation of gpl20 expression 
vehicles is described hereinafter. 

20 Production of crpl20 

gpl20 in the vaccine can be produced by any 
suitable means, as with prior art HIV gpl20 subunit 
vaccines. Recombinantly-produced or chemically 
synthesized gpl20 is preferable to gpl20 isolated 

25 directly from HIV for safety reasons. Methods for 
recombinant production of gpl20 are described below. 

DNA Encoding GNE, and GNE | ^ 00120 
and the resultant proteins 
30 The present invention also provides novel DNA 

sequences encoding gpl20 from the GNEj and GNEi^ isolates 
which can be used to express gpl20 and the resultant 
gpl20 proteins. A nucleotide sequence of less than 
about 5 kilobases (Kb) , preferably less than about 3 Kb 
35 having the nucleotide sequence illustrated in Tables 1 
and 2, respectively, encodes gpl20 from the GNEg and 
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GNE,4 isolates. The sequences of the genes and the 
encoded proteins are shown below in Tables 1-3, In 
particular, Table 1 illustrates the nucleotide sequence 
(SEQ. ID. NO. 27) and the predicted amino acid sequence 
5 (SEQ. . ID. NO. 28) of the GNEg isolate of HIV. The upper 
sequence is the coding strand. The table also 
illustrates the location of each of the restriction 
sites. 
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Table 2 ill\istrates the nucleotide sequence and 
the predicted amino acid s quence f the GNEjj isolate 

f HIV. The upper sequenc is the coding strand. The 
table also illustrates the location of each of the 
5 restriction sites. The first four pages of the table 
are from one clone of the gene and the second three 
pages of the table are from another clone of the gene. 
The sequences of the clones differ by about 2%. (The 
nucleotide sequences are SEQ. ID. NOs. 28 and 29, 
10 respectively. The amino acid sequences are SEQ. ID. 
NOs. 30 and 31, respectively.) It is noted that each 
of the sequences includes a stop codon. A gene 
sequence that encodes full length gpl20 can be made by 
repairing one of the sequences. 
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Table 3 illustrates the amino acid sequences for 
the MN, GNEg, and GNEj^ gpl20 proteins* The regions of 
the sequences having identical amino acid sec[uences are 
enclosed in boxes. 

TABLE 3 



gpl60.8.34 

gpl60.SF.16.7 

gpl60.8.24 

gpl60.SF.16.2 

gpl60.SF.16.7 



n V K 
fl V )C 



R N 



0 H L 
0 H L 
0 H L 



WG T M L L 0 
WG T M L L G 
WQ T M L I G 



L M I C S A A 
L M I C S A A 
L U I C $ A A 



e|kl wvtvyyovpvwke 
gklwvtvyygvpvwke 
k l wv t vyygvpvwke 




tFCASOAKAYDTE 
LFCASOAKAYDTE 
FCASOAKAYDTE 



HNVWATHACVPTOPHPQE 
HNVWATMACVPT0PMPQ6 
HMVWATHACVPTOPHPOe 



G|LENVTENFNUWKNMMV 
VtL ENVTENFNMWKNNMV 
ENVTENfNUWKNNUV 



gpl60.8.24 


101 


EOMHED 1 IStWOOSlKPCVKtTPLCVTLNCTO 


L K 


N 


A 


TNT 


T 


s s s 


WG 


K 


M 


E 


fl 


G 


gpl60.SF.l6,2 


101 


EOMHEOIISLWDOSIKPCVKLTPLCVTLNCTO 


A G 


H 


T 


TNT 


N 


s s s 


fl E 


K 


L 


E 


K 


G 


gpl60.SF.16.7 


kh 


EQMHEO 1 ISIWDOSLKPCVKITPI CVTINCTD 


A G 


N 


T 


TNT 


N 


s s s 


G E 


K 


I 


E 


K 


G 



gpl60.8.24 

gpl60.SF.l6.2 

gpl60.SF.16.7 

gpl60.8.24 

gpl60.SF.16.2 

gpl60.SP.16.7 

gpl60.8.24 

gpl60.SF.16.2 

gpl60.SF.16.7 

gpl60.8.24 

gpl60,SF.16.2 

QP160.SP.16.7 

gpl60.8.24 

S(pl60.SF.a6.2 

gpl60.SF.16.7 

gpl60.8.24 
gpl60.SP.16.2 
*gpl60.SF.l6.7 



151 

isi 
isi 

134 
201 
201 

244 

251 
Zl 

294 

301 
301 

344 

3S1 
3S1 



£ t K N C S F N 
£ t K N C S F N 
E I K N C S F N 



V T T S 
t T T S 
(ITS 



R 0 K M 
R D K U 
R 0 K M 



A L F 
A L F 
T A t. F 



K L 0 
K L 0 
K L D 



V P I D 

V P I 0 

V P I 0 



D R N S r R N : 
D R N 5 T R N ; 



Y R L ( S 
f Y R L IS 
H Y R I IS 



CNTSV ITOACPKVSFEP IP IH 
CNTSVITOACPKVSFEPIP IH 
CNTSV ITOACPKVSFEP I P IH 



P A G F A 
P A G F A 
P A Q F A 



L K C 
L K C 
L L K C 



R 0 K K 
N N K T 
N N E T 



F N G 
F N G 
F N G 



G P C 
P C 
G P C 



N V S T V 
N V S T V 
M V S T V 



0 C 
Q Z 
L C 



THGIRPVVSTOLLLNGSLA 
THGinPVVSTOLLLNGSLA 
THGIRPVVSTOLLLNGSLA 



E V V IRS 
E VV IRS 
E VV IRS 



N A K T 1 1 V 0 L 


N 


E 


S 


V 


E 


1 N 


N A K T 1 (VOL 


T 


E 


P 


V 


K 


1 N 


N A K T 1 t V 0 L 


K 


E 


P 


V 


K 


1 N 



CTRPNNNTR 
CTRPNNNTR 
CTRPNNNTR 



T K 
E K 
E K 



N K T I 
N K T i 
N K T I 



lOPGRAFYATG 
IGPGRAFYATG 
IGPGRAFYATG 



I I G 
f I G 
I I G 



I RQAHCNLS 
I RQAHCNLS 
I RQAHCNLS 



WN N T L 
WN N T L 
D WN N T L 



FNHSSQQDPE tVMHSFNC 
FNHSSGODPE IVMHSFNC 
FNHSSGGDPE IVIIH S FNC 



G G E F F Y C 
R G E F F Y C 
G E F F Y C 



T T 
NTT 
0 T T 



L F 
QL F 
QL F 



S T WN y 
S T WD N 
S T WN A 



T Y T WN M T E gIs 
TKV. - SNGTS 
R . n|s 



333 
401 

401 N N T 



N 0 T G R N 
T £ E N S T 
,T K E N S T 



I T L 
I T L 
I T L 



C R I K Q I 
P C R t X 0 I 
P C R t K 0 I 



ijNUWQEVGKAUYAPP I RQQ I RCSS 
ViNMWOEVQKAMYAPP IRGQI RCSS 
MWQEVGKAUYAPP I RGQ I RCSS 



gpl60.8.24 


443 


N 


I T Q L L L 


T R 0 G G 


• N 


N S 


£ T 


E 


1 


FRPOGGOMRONWRSEL YKYKVVK 


E P L G V A 


gpl60.SF.16.2 


443 


N 


1 T G L L I 


T R 0 G G 


S N 


N S 


M N 


E 


T 


FRPGGQOyRONWRSEL YKYKVVK 


E P L G V A 


gpl60.SF.16.7 


448 


N 


t T 6L L I 


T R 0 C G 


S s 


N S 


UN 


E 


T 


FRPGGQOURONWRSELYKYKVVK 


E P L G V A 



gpl60.e.24 

gpl60.SF.16.2 

gpl60.Sr,16.7 

gpl60.8.24 

gpl60.SF.l6.2 

gpl60.SF.16.7 



P T K A 
P T K A 
P T K A 



K R R V 
K R R V 
R R V 



MQREKRAVQIQAVFLGFLQAAQSTyQAAS 
V0REKRAVGIGAVFL0FL6AA6STUGAAS 
REKRAVGIGAVLFLGFLGAA GSTyGAAS 



VTLTVQARLLLSG 
ITLTVQARLLLSG 
ITLTVQARLLLSG 



S49 
548 



IVOOONNLLRAIEA 
IVOOONNLLRAfEA 
I V Q 0 ONNLLRA t EA 



0 H L L 0 L 
0 0 H L L 0 L 
0 0 H L L Q L 



VWGIKOLQARVLAVERYL 
VWGtKQLQARVLAVERYL 
TVWGIKQLOARVLAVERYL 



0 0 0 L L G 1 WG 
ROQOLLGIWG 



0 Q 0 L L G I WG 



-39- 



wo 94/28929 



PCTAJS94/0d036 



gpl60.8.24 

flpl60.SF.16-2 

gpl60,SF.16.7 

gpl60.8.24 

gpl60.SF.16.2 

gpl60.SF.16.7 

gpl60.8.24 

gpl60.SF.16.2 

gpl60.SF.16.7 



613 
6tt 



CS6KL ICTT 
CSGKl ICTT 
C S 0 K I ICTT 



PWNASWSNKSLDKIWONUTWMEWERE I 
SlVPWNASWSNKSLOKlWDNUTWMEWERei 
SfVPWNASWSMKSlOKIWDKMTWMEWERE I 



y T 8 t IV 
Y T S L I Y 
y T S L I Y 



S L I E 
T t I E 
T L I E 



ESONOQEKK 
ESONOOEKN 
ESONOQEJCN 



E 0 



E QD L 



e|l t- £L 0 
L E L 
L L E L 0 



00 WA 



KWlASlWNWF 

s L WN wr 

S L WN WfIs 



tTKWLWYIKIFIMIVQOLVOLRI 
iTttWLWYlK If lUIVQQLVOlR I 
f T K Wl wriKIFIMIVGQLV G L R I 



V F T 



vls ivnrvr 

VtS IVNRVfl 
VLS IVNRVfl 



QG 



K |q Y S P L S F 0 T 
YSPLSFOT 
GYSPLSFQT 



n L 



L P A P R 
P A P R 
L P A P fl 



0 R P 
D R P 
D R P 



E|6T 
I 



£ E E G G E 
E E £ GG E 
E E E G 6 E 



R D R 0 R S 



R 0 R S 
R 0 R S 



gpl60.8.24 

gpl60.SF.16.2 

gpl6G.SF.16.7 

gpl60.a.24 

gpl60.SF.l6.2 

gpl60.sr.l6.7 

gpl60.8.24 

gpl60.SF.16.2 

gpl60.SF.16.7 



743 
7« 
748 

793 
7» 
796 



•49 
848 



L VOGF (. A 
L V 0 G F L A 
L V 0 G F l' A 



I V W V 



L I WO 



OLRSICLFSYKRLROLIUIA 
DLRSLCLFSYHRLROLLL lA 
OLRStCLFSYKftLRDLLL lA 



R IVELLGRRGWEALKYWW 
nrVELLGRRGWEALKYWW 
RIVELLGRRGWEALKYWW 



NLIQYWIOEI.KNSAVSLLN 
NLLOYWIOELKNSAVSLLN 
NLLOYWIOELKNSAVSLLN 



AIAVAEGTORV 
A lAVAEGTORV 
A lAVAEGTDRV 



A L 



ORAYRAILHIPTRI 
VLORAVRAILHIPTRI 



ORAYRAflHIPTRI 



RQOLERALL 
ROGIERAIL 
ROGLERALL 



-40- 



wo 94/28929 



PCT/US94/06a36 



Nucleic acid sequences encoding gpl20 from GNEg and 
GNEia capable of expressing gpl20 can be prepared by 
conventional means. The nucleotide sequence can be 
5 synthesized. Alternatively, another HIV nucleic acid 
sequence encoding gpl20 can be used as a backbone and 
altered at any differing residues by site directed 
mutagenesis as described in detail in Example 1. 

In a preferred embodiment, the nucleotide sequence 

10 is present in an expression construct containing DNA 
encoding gpl20 under the transcriptional and 
translational control of a promoter for expression of 
the encoded protein. The promoter can be a eukaryotic 
promoter for expression in a mammalian cell. In cases 

15 where one wishes to expand the promoter or produce 
gpl20 in a prokaryotic host, the promoter can be a 
prokaryotic promoter. Usually a strong promoter is 
employed to provide high level transcription and 
expression. 

20 The expression construct can be part of a vector 

capable of stable extrachromosomal maintenance in an 
appropriate cellular host or may be integrated into 
host genomes. Normally, markers are provided with the 
expression construct which allow for selection of a 

25 host containing the construct. The marker can be on 
the same or a different DNA molecule, desirably, the 
same DNA molecule. 

The expression construct can be joined to a 
replication system recognized by the intended host 

30 cell. Various replication systems include viral 

replication systems such as retroviruses, simian virus, 
bovine papilloma virus, or the like. In addition, the 
construct may be joined to an amplifiable gene, e.g. 
DHFR gene, so that multiple copies of the gpl20 DNA can 

35 be made. Introduction of the construct into the host 
will vary depending on the construct and can be 
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achieved by any convenient means. A wide variety of 
prokaryotic and exDcary tic hosts can be employed for 
expression of the proteins. 

Preferably, the gpl20 is expressed in mammalian 
5 cells that provide the same glycosylation and disulfide 
bonds as in native gpl20. Expression of gpl20 and 
fragments of gpl20 in mammalian cells as fusion 
proteins incorporating N-terminal sequences of Herpes 
Simplex Virus Type 1 (HSV-l) glycoprotein D (gD-1) is 

10 described in Lasky, L. A. et al., 1986 (Neutralization 
of the AIDS retrovirus by antibodies to a recombinant 
envelope glycoprotein) Science 233: 209-212 and Haffar, 
O.K. et al., 1991 (The cytoplasmic tail of HIV-1 gpl60 
contains regions that associate with cellular 

15 membranes.) Virol. 180:439-441, respectively. A 

preferred method for expressing gpl20 is described in 
Example 3. In the example, a heterologous signal 
sequence was used for convenient expression of the 
protein. However, the protein can also be expressed 

20 using the native signal sequence. 

An isolated, purified GNE8-gpl20 and GNEi6-gpl20 
having the amino acid sequence illustrated in Tables 
1-3 can be produced by conventional methods. For 
example, the proteins can be chemically synthesized. 

25 In a preferred embodiment, the proteins are expressed 
in mammalian cells using an expression construct of 
this invention. The expressed proteins can be purified 
by conventional means. A preferred purification 
procedxire is described in Example 3. 

30 

qpl20 Fragments 
The present invention also provides gpl20 
fragments that are suitable for use in inducing 
antibodies for use in serotyping or in a vaccine 
35 formulation. A truncated gpl20 sequence as used herein 
is a fragment of gpl20 that is free from a portion of 
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the intact gpl20 sequence beginning at either the amino 
or carboxy terminus of gpl20. A truncated gpl20 
sequenc of this invention is free from the C5 domain. 
The C5 domain of gpl20 is a major immunogenic site of 
5 the molecule. However, antibodies to the region do not 
neutralize virus. Therefore, elimination of this 
portion of gpl20 from immunogens used to induce 
antibodies for serotyping is advantageous. 

In another embodiment, the truncated gpl20 

10 sequence is additionally free from the carboxy terminus 
region through about amino acid residue 4 53 of the 
gpl20 V5 domain. The portion of the V5 domain 
remaining in the sequence provides a convenient 
restriction site for preparation of expression 

15 constructs. However, a truncated gpl20 sequence that 
is free from the entire gpl20 V5 domain is also 
suitable for use in inducing antibodies. 

In addition, portions of the carboxy terminus of 
gpl20 can also be eliminated from the truncated gpl20 

20 sequence. The truncated gpl20 sequence can 

additionally be free from the gpl20 signal sequence. 
The truncated gpl20 sequence can be free from the 
carboxy terminus through amino acid residue 111 of the 
gpl20 01 domain, eliminating most of the CI domain but 

25 preserving a convenient restriction site. However, the 
portion of the CI domain through the cysteine residue 
that forms a disulfide bond can additionally be 
removed, so that the truncated gpl20 sequence is free 
from the carboxy terminus through amino acid residue 

30 117 of the gpl20 CI domain. Alternatively, the 

truncated gpl20 sequence can be free from the amino 
terminus of gpl20 through residue 111 of the CI domain, 
preserving the V2 disulfide bond. In a preferred 
embodiment, the truncated gpl20 sequence is free from 

35 the amino terminus of gpl20 through residue 111 of the 
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CI domain and residue 453 through the carboxy terminus 
of gpl20. 

The truncated gpl20 sequences can be produced by 
recombinant engineering, as described previously. 
5 Conveniently, DNA encoding the truncated gpl20 sequence 
is joined to a heterologous DNA sequence encoding a 
signal sequence. 

Serotvpina method 

10 The present invention also provides an improved 

serotyping method for HIV strains. The method 
comprises determining the serotypes of the V2, V3, and 
C4 domains of gpl20. 

HIV isolates can be serotyped by conventional 

15 immunoassay methods employing antibodies to the 

neutralizing epitopes in the V2, V3, and C4 domains for 
various strains of HIV. Preparation of the antibodies 
is described hereinbefore. The antibody affinity 
required for serotyping HIV using a particular 

20 immunoassay method does not differ from that required 
to detect other polypeptide analytes. The emtibody 
composition can be polyclonal or monoclonal, preferably 
monoclonal. 

A number of different types of immunoassays are 
25 well known using a variety of protocols and labels. 
The assay conditions and reagents may be any of a 
variety found in the prior art. The assay may be 
heterogeneous or homogeneous. Conveniently, an HIV 
isolate is adsorbed to a solid phase and detected with 
30 antibody specific for one strain of neutralizing 

epitope for each neutralizing epitope in the V2, V3, 
and C4 domain. Alternatively, supernatant or lysate 
from the cultured isolate which contains gpl20 can be 
adsorbed to the solid phase. The virus or gpl20 can be 
35 adsorbed by many well known non-specific binding 
methods. Alternatively, an anti-gpl20 antibody. 
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preferably directed to the carboxy tenninus of gpl20 
can be used to affix 9pl20 to the solid phase* A gpl20 
capture antibody and sandwich ELISA assay f r gpl20 
neutralizing epitopes is described by Moore, AIDS Res. 
5 Hum, Retroviruses 9:209-219 (1993), Binding between 

the antibodies and sample can be determined in a number 
of ways. Complex formation can be determined by use of 
soluble antibodies specific for the anti-gpl20 
antibody. The soluble antibodies can be labeled 

10 directly or can be detected using labeled second 

antibodies specific for the species of the soluble 
antibodies. Various labels include radionuclides, 
enzymes, fluorescers, colloidal metals or the like. 
Conveniently, the anti-gpl2 0 antibodies will be labeled 

15 directly, conveniently with an enzyme. 

Alternatively, other methods for determining the 
neutralizing epitopes can be used. For example, 
fluorescent-labeled antibodies for a neutralizing 
epitope can be combined with cells infected by the 

20 strain of HIV to be serotyped and analyzed by 
fluorescence activated cell sorting. 

The serotype of the HIV isolate includes the 
strain of the neutralizing epitopes for the V2, V3, and 
C4 domains. 

25 

It is understood that the application of the 
teachings of the present invention to a specific 
problem or situation will be within the capabilities of 
one having ordinary skill in the art in light of the 

30 teachings contained herein. Examples of the products 
of the present invention and representative processes 
for their isolation, use, and manufacttire appear below, 
but should not be construed to limit the invention. 
All literatxire citations herein are expressly 

35 incorporated by reference. 
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EXAMPLE 1 

Identification of C4 Neutralizing Epitopes 
The following reagents and methods were used in 
the studies described herein. 

5 

gpl20 sequences and nomenclature. Amino acid 
residues are designated using the standard single 
letter code- The location of amino acids within the 
gpl20 protein is specified using the initiator 

10 methionine residue as position 1. The designation LAI 
is used to describe the virus isolate from which the 
HIV-1b„jo/ HIV-linB, HIV-lgRur HIV-I^, HIV-1hxb3 and 
HIV- Ihxbio substrains (molecular clones) of HIV-l were 
obtained. The sequence of gpl20 from IIIB substrain of 

15 HIV-lj^j is that determined by Muesing et al. (30). 

The sequence of gpl20 from MN strain of HIV-l is 
given with reference to the MNgpl20 clone (MNone) • The 
sequence of this clone differs by approximately 2% from 
that of the MN,9g4 clone described by Gurgo et al . (13). 

20 The sequences of gpl20 from the NY-S, JRcsf, Z6, Z221f 
and HXB2 strains of HIV-l are those listed by Myers et 
ai. (32) except where noted otherwise. The sequence of 
the Thai isolate A244 is that provided by McCutchan et 
al. (24). The variable (V) domains and conserved (C) 

25 domains of gpl20 are specified according to the 
nomenclature of Modrow et al. (28). 

Monoclonal antibody production and screening 
assays* Hybridomas producing monoclonal antibodies to 

30 MN-rgpl20 ( recombinant ly produced gpl20 from the MN 
strain of HIV) (3) were prepared and screened for CD4 
blocking activity as described previously (7, 33). The 
binding of monoclonal antibodies to MN-rgpl20 and to 
rgpl20s from the IIIB, NY-5, 26, Z321, JRcsf, and A244 

35 strains of HIV-l was assessed by enzyme linked 
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immunoadsorbant assays (ELISA) as described previously 
(33). 

Virus binding and neutralization assays. The 

5 ability of monoclonal antibodies to neutralize HIV-1 

infectivity in vitro was assessed in a colorimetric MT- 
2 cell cytotoxicity assay similar to that described 
previously (35) . MT-2 cells and H9/HTLV-IIImn cells 
were obtained through the AIDS Research and Reference 

10 Reagent Program, Division of AIDS, NIAID, NIH: 

contributed by Drs. Douglas Richman and Robert Gallo, 
respectively. Briefly, serial dilutions of antibody or 
serum were prepared in 50 /xl volumes of complete and 
then 50 fil of a prediluted HIV-1 stock was added to 

15 each well. After incubation for 1 hr at 4*'C, 100 ^1 of 
a 4 X 10^ MT-2 cell/ml suspension was added. After 
incubation of the plates for 5 days at 37*0 in 5% CO3, 
viable cells were measured using metabolic conversion 
of the formazan MTT dye. Each well received 20 fil of a 

20 5 mg/ml MTT solution in PBS. 

After a 4 hr incubation at 37 •C, the dye 
precipitate was dissolved by removing 100 fil of the 
cell supernatant, adding 130 ^tl of 10% Triton X-100 in 
acid isopropanol, then pipeting until the precipitate 

25 was dissolved. The optical density of the wells was 
determined at 540 nm. The percentage inhibition was 
calculated using the formula: 

l-r virus control-exper imental^ 
(virus control -medium control) 

30 

Cell surface staining of HIV-1 infected colls with 
monoclonal antibodies. H9 cells (2 x 10*) chronically 
infected with the IIIB, HXB2, HXB3, and HXIO substrains 
of HIV-li^j or with HIV-Imn were incubated for 30 min at 
35 room temperature with monoclonal antibodies (10 ng per 
ml) in 100 fil of RPMI 164 0 cell culture media 
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containing 1% FCS. Cells were washed and then 
incixbated with 20 fig per ml of f luorescein-conjugated, 
affinity-purified, goat antibody to mouse IgG (Fab*)2 
(Cappel, West Chester, PA) for 30 min. Cells were 
5 washed, fixed with 1% paraformaldehyde and the bound 
antibody was quantitated by flow cytometry using a 
FACSCAN (Becton-Dickenson, Fullerton, CA) . 

Fluorescence data was expressed as percentage of 
fluorescent cells compared to the fluorescence obtained 
10 with the second antibody alone. Fluorescence was 

measured as the mean intensity of the cells expressed 
as mean channel number plotted on a log scale. 

Fracfmentation of the MN-rgpl20 gene. Fragments of 

15 the MN-rgpl20 gene were generated using the polymerase 
chain reaction (PGR) (17). Briefly, forward 30-mer 
oligonucleotide DNA primers incorporating a Xho 1 site, 
and reverse 3 6-mer oligonucleotide DNA primers 
containing a stop codon followed by a Xba 1 site were 

20 synthesized and used for the polymerase chain 

reactions. Thirty cycles of the PGR reaction were 
performed using 0.3 ;ig of a plasmid containing the gene 
for gpl20 from the MN strain of HIV-1 (pRKMN. D533) and 
0.04 nM of a designated primers. The PGR reaction 

25 buffer consisted of 0.1 M Tris buffer (pH 8.4), 50 mM 
KCl, 0.2 mM 4dNTP (Pharmacia, Piscataway, NJ) , 0.15 M 
MgClj and 0.5 Unit of Tag Polymerase (Perkin-Elmer 
Getus, Norwalk, GN) and a typical PGR cycle consisted 
of a 60 second denaturation step at 94**G, followed by a 

30 45 second annealing step at 55*» G, and then an 
extension step at 72** G for 45 seconds. 

Following the PGR amplification, the PGR products 
were purified by phenol and chloroform extraction, and 
then ethanol precipitated. The purified products were 

35 then digested with the restriction endonucleases Xhol 

and Xbal. The resulting PGR products were gel purified 



-48- 



wo 94/28929 



PCT/US94/06036 



using 1% agarose (SEAKEM, FMC Bioproducts, Rockland, 
ME) or 5% polyacrylamide gel electr ph resis (PAGE) and 
then isolated by electroelution. 

5 Site directed mutagenesis of the KN-rgpl20 C4 

domain. A recombinant PGR technique (15) was utilized 
to introduce single amino acid substitutions at 
selected sites into a 600 bp Bgl II fragment of 
MN-rgpl20 that contained the C4 domain. This method 

10 entailed the PGR amplification of overlapping regions 
of the C4 domain of gpl20 using primers that 
incorporated the desired nucleotide changes. The 
resultant PGR products were then annealed and PGR 
amplified to generate the final product. For these 

15 reactions 18-mer "outside" primers encoding the wild 

type sequence (Bgl II sites) were amplified with 36-mer 
"inside" primers that contained the alanine or glutamic 
acid residue changes. The first PGR reaction included 
IX of the Vent polymerase buffer (New England Biolabs, 

20 Beverly, MA), 0.2 mM of 4dNTP (Pharmacia, Piscataway, 
N.J.), 0.04 nM of each synthetic oligonucleotide, 0.3 
^g of linearized plasmid, pRKMN.D533, which contained 
the MN-rgpl20 gene. Thirty PGR cycles were performed 
consisting of the following sequence of steps: 45 

25 seconds of denaturation at 94 •G, 45 second of annealing 
at 55'C and 45 seconds of extension at 72**G. Following 
PGR amplification, the product pairs were gel purified 
using a 1% solution of low melt agarose (SeaPlaque, FMG 
Bioproducts, Rockland, ME) . 

30 The agarose containing PGR product was melted at 

65 ®G and combined with the PGR product of the 
overlapping pair and equilibrated to 37 «G. Added to 
this (20 fil) was 10 ^1 of lOX Vent Polymerase buffer, 
10 /xl of 2 mM 4dNTP, 0.04 nM each of the "outside" wild 

35 type 18 mer oligonucleotides, 57 ^1 of HjO and 1 unit of 
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Vent Polymerase. Thirty PGR cycles were performed as 
previously above. 

The resulting PGR products were purified and 
digested with the Bgl II endonuclease. The digested 
PGR product was then ligated into the mammalian cell 
expression vector pRKMN.D533; which had been digested 
with Bgl II allowing for the removal of a 600 bp 
fragment. Colonies containing the correct insertion 
were identified and Sequenase 2.0 supercoil sequencing 
was employed to check for fidelity and the 
incorporation of the desired mutation. 

Expression of gpX20 fragments in mammalian cells. 

Fragments of the MN and IIIB gpl20 were expressed in 
mammalian cells as fusion proteins incorporating 
N-terminal sequences of Herpes Simplex Virus Type 1 
(HSV-1) glycoprotein D (gD-^l) as described previously 
(14, 22). Briefly, isolated DNA fragments generated by 
the PGR reaction were ligated into a plasmid (pRK*gD-l) 
designed to fuse the gpl20 fragments, in frame, to the 
5' sequences of the glycoprotein D (gD) gene of Type 1 
Herpes Simplex Virus (gD-l)and the 3» end to 
translational stop codons. The fragment of the gD-l 
gene' encoded the signal sequence and 25 amino acids of 
the mature form of HSV-1 protein. To allow for 
expression in mammalian cells, chimeric genes fragments 
were cloned into the pRK5 expression plasmid (8) that 
contained a polylinker with cloning sites and 
translational stop codons located between a 
cytomegalovirus promoter and a simian virus 40 virus 
polyadenylation site. 

The resulting plasmids were transfected into the 
293s embryonic human kidney cell line (12) using a 
calcium phosphate technique (11) . Growth conditioned 
cell cultxxre media was collected 48 hr after 
transfection, and the soluble proteins were detected by 
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ELISA or by specific radioinununoprecipitation where 
metabolically labeled proteins from cell culture 
supernatants were resolved by sodivim dodecyl sulfate 
polyacrylamide gel electrophoresis (PAGE) and 
5 visualized by autoradiography as described previously 
(1, 18). 

Radioimmunoprecipitation of MN-rgpl2 0 mutants. 

Plasmids directing the expression of the MN-rgpl20 C4 

10 domain mutants were transfected into 293s cells as 
described above. Twenty four hours following the 
transfection, the cells were metabolically labeled with 
[^*S] -labeled methionine or cysteine as described 
previously (1) . The labeled cell culture supernatants 

15 were then harvested and 0.5 ml aliquots were reacted 

with 1-5 ^g of the monoclonal antibody or with 2 /xl of 
the polyclonal rabbit antisera to MN-rgpl20 and 
immunoprecipitated with Pansorbin (CalBiochem, La 
Jolla, CA) as described previously (1) . The resulting 

20 Pansorbin complex was pelleted by centrifugation, 

washed twice with a solution containing PBS, 1% NP-40 
and 0.05% SDS and then boiled in a PAGE sample buffer 
containing 1% 2-mercaptoethanol. The processed samples 
were the analyzed by SDS-PAGE and visualized by 

25 autoradiography (1, 18). 

Assays to measure the binding of monoclonal 
antibodies to mutagenized MN-rgpl20 polypeptides. An 

ELISA was developed to screen for reactivity of 
30 MN-rgpl20 fragments and mutant proteins with veurious 
monoclonal antibodies. In this assay, 96 well 
microtiter dishes (Maxisorp, Nunc, Roskilde, Denmark) 
were coated overnight with mouse monoclonal antibody 
(5B6) to gD-1, at a concentration of 2.0 /xg/ml in 
35 phosphate buffered saline (PBS) . The plates were 

blocked in a PBS solution containing 0.5% bovine serxam 
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albumin (PBS A) and then incubated with growth 
conditi ned cell cultur m dium fr la transfected cells 
expressing the recombinant gpl20 variants for 2 hr at 
room temperature. The plates were washed three times 
5 in PBS containing 0.05% Tween 20 and then incubated 
with the purified, HRPO-conjugated monoclonal 
antibodies. Following a 1 hr incubation, the plates 
were washed three times and developed with the 
colorimetric substrate, o-phenylenediamine (Sigma, St. 

10 Louis, MO) . 

The optical densities in each well were then read 
in a microtiter plate reading spectrophotometer at 
492 nm. Each cell culture supernatant containing 
fragments or mutated rgpl20s was normalized for 

15 expression based on the titering of its reactivity to 
the V3 binding monoclonal antibody 1034 or to a rabbit 
polyclonal antisera to MN-rgpl20. Data from these 
experiments were expressed as a ratio of the optical 
densities obtained with the CD4 blocking monoclonal 

20 antibodies binding to the fragments or MN-rgpl20 
mutants compared with the full length wild type 
rgpl20s. 

To normalize for different concentrations of 
MN-rgpl20-derived protein in the cell cultxire 

25 supernatants , the binding of the CD4 blocking 

monoclonal antibodies to each preparation was compared 
to that of an HRPO-conjugated monoclonal antibody to 
the V3 domain of MN-rgpl20 (1034). Data from these 
experiments were expressed as a ratio of the optical 

30 densities obtained with the CD4 blocking monoclonal 
antibodies to the HRPO conjugated V3 reactive 
monoclonal antibody. 



CD4 binding assays. The ability of monoclonal 
35 antibodies to inhibit the binding of MN-rgpl20 to 

recombinant soluble CD4 (rsCD4) was determined in a 
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solid phase radlolinznunoassay similar to that: described 
previously (33). The effect of single amino acid 
substitutions on the binding of MN-rgpl20 mutants to 
CD4 was determined In a co-lmmunoprecipltatlon assay 
similar to that described previously (21) • Briefly, 
293 cells were metabolically labeled with ^^S-methionlne 
24 hr after transfection with plasmids expressing 
MN-rgpl20 variants. Growth conditioned cell culture 
medium (0.5 ml) was then incubated with 5.0 /ig of 
recombinant sCD4 for 90 minutes at room temperature. 
Following this incubation, 5.0 /ig of an anti-CD4 
monoclonal antibody (465) , known to bind to an epitope 
remote from the gpl20 binding site, was added and 
allowed to incubate another 90 minutes at room 
temperature . 

The gpl20-CD4 -antibody complexes were precipitated 
with Pansorbln that had been washed with PBS, 
preabsorbed with 0.1% bovine serum albumin and then 
bound with 50 fig of an affinity ptirified rabbit anti- 
mouse IgG (Cappel, West Chester, PA) . The pellet was 
washed twice with PBS 1% NP-40, 0.05% SDS, and then 
boiled in beta mercaptoethanol containing SDS-PAGE 
sample buffer. The immunoprecipitation products were 
resolved by SDS PAGE and visualized by autoradiography 
as described previously (1, 21). 

Antibody affinity measurements. Antl-gpl20 
antibodies were iodlnated with Na '^I with lodogen* 
(Pierce, Rockford, IL) . Briefly, 50 fig of antibody In 
PBS was placed in 1.5 ml polypropylene microcentrifuge 
tubes coated with 50 fig of lodogen. Two mllllcuries of 
carrier free Na[^^I] was added. After 15 min. , free ^^I 
was separated from the labeled protein by 
chromatography on a PD-10 column (Pierce, Rockford, IL) 
pre-equilibrated in PBS containing 0.5% gelatin. 
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Antibody concentrations following iodination were 
determined by ELISA to calculate specific activities. 

For binding assays, 96-well microtiter plates were 
coated with 100 ;il/well of a 10 ptg/m solution of 
5 MN-rgpl20 or IIIBrgpl20 in 0.1 M bicarbonate buffer, 
pH 9.6 and incubated for 2 hr at room temperature or 
overnight at 4**C. To prevent non-specific binding, 
plates were blocked for 1-2 hr at room temperature with 
200 ;il/well of a gelatin solution consisting of PBS 

10 containing 0.5% (wt/vol) gelatin and 0.02% sodixim 

azide. Unlabeled anti-gpl2 0 monoclonal antibody (0 to 
400 nM) was titrated (in duplicate) in situ and 
radiolabeled antibody was added to each well at a 
concentration of 0.5 nM. 

15 After a 1-2 hr incubation at room temperature, the 

plate was washed lOx with the PBS/ 0.5% gelatin/0. 02% 
azide buffer to remove free antibody. The antibody- 
gpl20 complexes were solubilized with 0.1 N NaOH/0.1% 
SDS solution and counted in a gamma counter. The data 

20 were analyzed by the method of Scatchard (40) using the 
Ligand analytical software program (31) . values 
reported represent the means of four independent 
determinations . 

25 RESULTS 

Characterization of monoclonal antibodies to HN-rgpl20 
that blocX CD4 binding. Monoclonal antibodies prepared 
from mice ixnmunized with MN-rgpl20 (3, 33), were 
screened for the ability to bind to MN^rgpl20 coated 

30 microtiter dishes by ELISA as described previously 

(33). Of the thirty five clones obtained, seven were 
identified (1024, 1093, 1096, 1097, 1110, 1112, and 
1127) that were able to inhibit the binding of 
MN-rgpl20 to recombinant CD4 in ELISA (Figure 1) or 

35 solid phase or cell surface radioimmunoassays (21, 33). 
Previous studies have shown that two distinct classes 
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of CD4 blocking monoclonal antibodies occiur: those that 
bind to conformation dependent (discontinuous) epitopes 
(16, 26, 33, 35, 45) and those that bind to 
conformation independent (sequential) epitopes (4, 7, 
21, 33, 43). 

To distinguish between these two alternatives, the 
binding of the monoclonal antibodies to denatured 
(reduced and carboxymethylated) MN-rgpl20 (RCM-gpl20) 
was measured by ELISA as described previously (33) . As 
illustrated in Table 4, below, it was found that all of 
the CD4 blocking monoclonal antibodies reacted with the 
chemically denatured protein; indicating that they all 
recognized conformation independent (sequential) 
epitopes. 
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Table 4 

Properties of monoclonal antibodies t MN-rgpl20 





CD4 


HIV— 1 mn 












Inhi- 


Neutral- 


HIV-1 mn 


CM- 


Domain 


croaa 






ization 








.i^^aq^ivi^ty 


1024 


+ 










2 


1093 








+ 




2 


1096 




+ 








2 


1097 


+ 


+ 




+ 




2 


1110 


+ 


+ 




+ 




2 


1112 


+ 


+ 




+ 




2 


1127 


+ 










2 


1026 












1,2,3,4,6 


1092 








+ 




1,2,3,4,5 


1126 








+ 




1,2,3,5,7 


1086 








+ 




2 


13H8 


+ 






+ 


1,3 


1,2,3,4,5,6,7 



20 

rgpl20 cross reactivity: 1, IIIB-rgl20; 2, MN-'rgpl20, 
3, NYS-rgpl20; 4, JrCSF-rgpl20; 5, Z6-rgpl20; €, Z321- 
rgpl20; 7, A244-rgpl20 

25 

C4 domain peptides: 

1, FINMWQEVGKAMYAPPIS (SEQ. ID. NO. 24); 

2, MWQEVGKAMYAP (SEQ. ID. NO. 25) ; 

3, GKAMYAPPIKGQIR (SEQ. ID. NO. 26) 

30 

The cross reactivity of these monoclonal 
antibodies was assessed by ELISA as described 
previously (33). In these experiments, the ability of 
the monoclonal antibodies to bind to a panel of seven 

35 rgpl20s, prepared from the IIIB, MN, 26, 2321, NY-5, 
A244, and JRcsf isolates of HIV-1, was meastured by 
ELISA (33). It was found that all of the CD4 blocking 
monoclonal antibodies were strain specific and bound 
only to gpl2*0 from the MN strain of HIV-1 (Table 4) . 

40 However, other antibodies from the same fusion 
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(1026,1092, and 1126) exhibited much broader cross 
reactivity (Table 4, Figure 2), as did a CD4 blocking 
monoclonal antibody to IIIB-rgpl20 (13H8) described 
previously (33). 
5 Further studies were performed to characterize the 

neutralizing activity of the antibodies to MN-rgpl20. 
In these studies, monoclonal antibodies were incubated 
with cell free virus (HIV-Im^) , and the resulting 
mixture was then used to infect MT-2 cells in 

10 microtiter plates. After 5 days, the plates were 

developed by addition of the color imetric dye, MTT, and 
cell viability was measured spectrophotometrically • It 
was found (Table 4, Figure 2) that all of the CD4 
blocking monoclonal antibodies were able to inhibit 

15 viral infectivity. However the potency of the 

monoclonal antibodies varied considerably with some 
monoclonal antibodies (eg. 1024) able to inhibit 
infection at very low concentrations (IC50 of 0.08 fig 
per ml) whereas other monoclonal antibodies (eg. 1112) 

20 required much higher concentrations (IC50 of 30 ^^9 P^^ 
ml) . In control experiments two monoclonal antibodies 
to MN-rgpl20 from the same fusion (eg. 1086, 1092) were 
ineffective, whereas the 1026 monoclonal antibody 
exhibited potent neutralizing activity. Similarly, 

25 monoclonal antibodies to the V3 domain of IIIB-rgpl20 

(10F6, 11G5) known to neutralize the infectivity HlV-lnm 
(33) , were unable to neutralize the HIV-Imn virus* 

Binding studies using synthetic peptides were then 
performed to further localize the epitopes recognized 

30 by these monoclonal antibodies as described previously 
(33) . When a peptide corresponding to the V3 domain 
(3) of MN-rgpl20 was tested, it was found that none of 
the CD4 blocking antibodies showed any reactivity. 
However the epitope recognized by the non-CD4 blocking 

35 monoclonal antibody, 1026, prepared against MN*rgpl20 
could be localized to the V3 domain by virtue of its 
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binding to this peptide. In other experiments, three 
synthetic peptides from the C4 domain of gpl20 that 
incorporated sequences recognized by the CD4 blocking, 
weakly neutralizing monoclonal antibodies described by 
5 McKeating et al. (26) were tested (Table 4). It was 
found that none of the CD4 .blocking monoclonal 
antibodies to MN-rgpl20 reacted with these peptides, 
however the non-neutralizing, CD4 blocking 13H8 
monoclonal antibody bound to the peptides corresponding 

10 to residues 423-440 of IIIB-gpl20 and residues 431-441 
of MN-gpl20, but not to that corresponding to residues 
426-437 of IIIB-gpl20. Thus the 13H8 monoclonal 
antibody recognized a epitope that was similar, if not 
identical, to that described by McKeating et al. (26). 

15 This result is consistent with the observation that the 
13H8 monoclonal antibody and the monoclonal antibodies 
described by Cordell et al. (4) and McKeating et al. 
(26) exhibited considerable cross reactivity, whereas 
the antibodies to MN-rgpl20 were highly strain 

20 specific. 

CD4 blocking antibodies recognize epitopes in the 
C4 domain. Previously, a strain specific, CD4 blocking 
monoclonal antibody (5C2) raised against IIIB-rgpl20 

25 was found to recognize an epitope in the 04 domain of 
IIIB-rgpl20 (21, 33). Although the 5C2 monoclonal 
antibody was able to block the binding of rgpl20 to 
CD4, it was unable to neutralize HIV-l infectivity in 
vitro (7) . Affinity columns prepared from 5C2 adsorbed 

30 an 11 amino acid peptide (residues 422 to 432) from a 
tryptic digest of gpl20 (21) , however monoclonal 
antibody 5C2 was xxnable to recognize this peptide 
coated onto wells of microtiter dishes in an ELISA 
format (Nakamura et al., unpublished results), 

35 To determine whether the CD4 blocking monoclonal 

antibodies raised against MN-rgpl20 recognized the 
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. corresponding epitope in the C4 domain of MN-rgpl20, a 
series of overlapping fragments, spanning the V4 and C4 
domains of HIV-Im^ gpl20, were prepared for expression 
in mammalian cells. A diagram of the fragments 
5 expressed is shown in Figures 3A and 3B, The C4 domain 
fragments were expressed as fusion proteins that 
incorporated the signal sequence and amino terminal 25 
amino acids of HSV-1 glycoprotein D as described above. 
Plasmids directing the expression of the chimeric 

10 C4 domain fragments were transfected into 293 cells, 
and their expression was monitored by 
radioimmunoprecipitation studies where a monoclonal 
antibody, 5B6, specific for the mature amino terminus 
of glycoprotein D was utilized. It was found 

15 (Figure 3B) that all of the fragments were expressed 
and exhibited mobilities on SDS-PAGE gels appropriate 
for their size. Thus fMN. 368-408 (lane 1) exhibited a 
mobility of 19 kD; fMN. 368-451 (lane 2) exhibited a 
mobility of 29 kD; fMN. 419-433 (lane 3) exhibited a 

20 mobility of 6 kD, and fMN. 414-451 (lane 4) exhibited a 
mobility of 6.1 kD. 

The binding of monoclonal antibody 1024 to the 
recombinant fragments was then determined by ELISA (as 
described in Example 1) . It was found (Figure 3A) that 

25 monoclonal antibody 1024 reacted with the fragments 

that contained the entire C4 domain of MN-rgpl20 (fMNj^i. 
431/ fMN40M55) f but failed to bind to a fragment derived 
from the adjacent V4 domain (fMN3^^) or to another 
fragment that contained V4 domain sequences and the 

30 amino terminal half of the C4 domain (fMNj^^js) • The 

fact that 1024 bound to the fMN4i445i and fMN4,^3 fragments 
demonstrated that the epitopes recognized by all of 
these monoclonal antibodies were contained entirely 
between residues 419 and 443 in the C4 domain. 
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Resldu s recognized by m n clonal antibodies that 
block binding of MM-rgpl20 t CD4. To identify 
specific amino acid residues that might be part of the 
epitopes recognized by these monoclonal antibodies, the 
5 sequence of the C4 domain of MN~rgpl20 was compared to 
those of the gpl20s from the six other rgpl20s that 
failed to react with the CD4 blocking monoclonal 
antibodies (Figure 4). It was noted that the sequence 
of MN-rgpl2 0 was unique in that K occurred at position 

10 429 whereas the other rgpl20s possessed either E,G, or 
R at this position. Another difference was noted at 
position 440 where E replaced K or S. To evaluate the 
significance of these substitutions, a series of point 
mutations were introduced into the MN-rgpl20 gene 

15 (Figure 5) . Plasmids expressing the mutant proteins 
were transfected into 293s cells, and expression was 
verified by radioimmunoprecipitation with a monoclonal 
antibody (1034) directed to the V3 domain of 
MN-rgpl20. Cell culture supernatants were harvested 

20 and used for the monoclonal antibody binding studies 
shown in Table 6. To verify expression, radio- 
immunoprecipitation studies using cell culture 
supernatants from cells metabolically labeled with 
[^^]S-methionine were performed using the 1024 

25 monoclonal antibody specific for the C4 domain of 

MN-rgpl20 (A) or the 1034 monoclonal antibody specific 
for the V3 domain of MN-rgpl20. Immune complexes were 
precipitated with the use of fixed S. aureus and the 
adsorbed proteins were resolved by SDS-PAGE. Proteins 

30 were visualized by autoradiography. The samples were: 
Lane 1, MN.419A; lane 2 MN.421A; lane 3 MN.429E; lane 
4, MN.429A; lane 5, MN,432A; lane 6, MN.440A; lane 7, 
MN-rgpl20. The immunoprecipitation study showed that 
1024 antibody binds well to all the variants except 3 

35 and 4 which are mutated at residue 429, 1034 antibody 
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was used as a control and precipi-tates with anti-V3 
antibodies. 

The effect of these mutations on the binding of 
the CD4 blocking monoclonal antibodies was then 
5 evaluated by ELISA as illustrated in Table 5, below. 

Table 5 

Binding of CD4 blocking monoclonal 
antibodies to C4 domain mutants 

10 



Proteins/ 
MAbs 


1024 


1093 


1096 


1097 


1110 


1112 


1127 


5C2 


MN-rgpl20 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


1.0 


0.05 


MN-419A 


1.11 


1.10 


0.94 


1.21 


0.78 


0.95 


1.10 


ND 


MN-421A 


1.11 


1.60 


0.88 


1.42 


1.34 


0.91 


1.10 


ND 


MN-429E 


0.03 


0.07 


0.11 


0,04 


0.10 


0.10 


0.02 


ND 


HN-429A 


0.10 


0.07 


0.14 


0.04 


0.09 


0.11 


0.05 


ND 


MN-432A 


0.77 


0.15 


0.59 


0.08 


0.12 


0.24 


0.26 


ND 


MN-440A 


1.06 


1.13 


1.08 


0.87 


1.12 


1.0 


1.3 


ND 


IIIB-rgpl20 


0.03 


ND 


ND 


ND 


ND 


ND 


ND 


1.0 


MN-423F 


ND 


HO 


ND 


ND 


ND 


ND 


ND 


0.45 


MN-423F,429E 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


1.09 



25 Data represent the relative binding of HAbs to the native and 
mutant fonns of r9pl20. Values were calculated by dividing the 
binding (determined by ELISA) of the CD4 blocking HAbs to the 
proteins indicated by the values obtained for the binding of a V3 
specific NAb (1034) to the same proteins (as described in 

30 Example 1). 

It was found that replacement of ^440 with an A 
residue (HN.440A)had no effect on the binding of the 
1024 monoclonal antibody or any of the other CD4 

35 blocking monoclonal antibodies (Table 5) . The 

significance of K at position 429 was then evaluated by 
substitution of either A (MN.429A) or E (MN.429E) at 
this location. It was found that the A for K 
substitution at position 429 {MN.420A) markedly reduced 

40 the binding of the 1024 monoclonal antibody and all of 
the other CD4 blocking monoclonal antibodies (Table 5) . 



-61- 



wo 94/28929 



PCT/US94/06036 



Similarly, the replacement of E for K (MN.429E) at this 
p sition totally abr gated the binding of the 1024 
monoclonal antibody and all of the other CD4 blocking 
monoclonal antibodies (Table 5) . Several other mutants 
5 were constructed to evaluate the role of positively 

charged residues in the C4 domain. It was found that A 
for K substitutions at positions 419 (MN.419A) and 
421(MN.421A) failed to interfere with the binding of 
any of the CD4 blocking monoclonal antibodies as 
10 illustrated in Table 6, below. 

Table 6 

Correlation Between Antibody Binding Affinity 
and Virus Neutralizing Activity 

15 



MAb 


Block 


K,, nM* 


l£5a 


1024" 


+ 


2.7 ± 0.9 


0.4 


1086*-' 




9.7 ± 2.2 




1093* 




9.9 ± 2.6 


3.3 


1096* 


+ 


10 ± 6 


12 


1097*^ 


+ 


13.4 ± 3.7 


12 


1110* 


+ 


12.1 ± 1.7 


12 


1112* 


+ 


20 ± 4.4 


200 


1127* 


+ 


9.3 ± 4 


3.3 


1086**' 




9.7 ± 2.2 




laHs'-* 




22 + 6 





* Blocked binding of rgpl20 MN to CD4. 

* Blocked binding of rgpl20 Illb, not rgpl20 MN, to 
30 CD4. 

' Mean of four determinations calculated using the 

method of Scatchard (40) . 

' Neutralization of HIV-l^o< infectivity in vitro. 

* Anti-rgpl20 MN antibody. 

35 ' Did not neutralize HIV-l infectivity. 

« Anti-rgpl20 Illb antibody. 
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However, when K at position 432 was replaced with 
A (MN432.A), the binding of all of the CD4 blocking 
antibodies was markedly reduced (Table 5) . 
Interestingly, the binding of monoclonal antibody 1024 
5 appeared less affected by this substitution than the 
other monoclonal antibodies (Table 5) . Thus, these 
studies demonstrated that K4J9 and K 432 were critical for 
the binding of all of the CD4 blocking monoclonal 
antibodies, and that K4,,, K4J,, and K440 did not appear to 
10 play a role in monoclonal antibody binding. 

Itoino acids recognized monoclonal antibodies that 
block binding of IIlB-rgpl20 to CD4. The 

identification of residues 429 and 432 as being part of 

15 the epitope recognized by the MN-rgpl20 specific CD4 
blocking monoclonal antibodies was particularly 
interesting since this region was previously found to 
be implicated in the binding of the 5C2 monoclonal 
antibody (21) . The properties of the 1024 like- 

20 monoclonal antibodies and the 5C2 monoclonal antibody 
differed from the C4 reactive monoclonal antibodies 
described by other investigators (4, 43) in that the 
former appeared strain specific and the latter were 
broadly cross reactive. To account for the strain 

25 specificity of these monoclonal antibodies, the 
sequence of the eleven amino acid peptide of 
IIIB-rgpl20 recognized by monoclonal antibody 5C2 was 
compared to the corresponding sequence of MN-rgpl20. 
It was found that the IIIB protein differed from the 

30 MNB protein at positions 429 where K replaced E and at 
position 423 where I replaced F (Figure 5). Because it 
was known from previous studies (33) that the 5C2 
monoclonal antibody was unable to bind to gpl20 from 
two strains (i.e., NY-5 and JRcsf) that also possessed 

35 E at position 423, it seemed unlikely that this 

position could account for the strain specificity of 
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5C2. Sec[uence comparison (Figure 5) also shoved that 
gpl20 from HIV-lnm unique in that a phenylalanine 
residue occvirred at position 423 whereas the other six 
strains examined possess an I at this position. 
5 To determine whether residues 423 and/or 429 could 

account for the type specificity of the 5C2 monoclonal 
antibody, a mutant of MN-rgpl20 was constructed which 
incorporated an F for I replacement at position 423 
(MN.423F). In addition, the MN-rgpl20 mutant, MN.429E 

10 (described above) was further mutagenized to 

incorporate a F for I substitution at position 423 
(MN.423F) , thus resulting in a double mutant 
(MN.423F,429E) whose sequence was identical to that of 
IIIB-rgpl20 within the 10 amino acid 5C2 epitope 

15 (Figure 4) . The expression of these mutants in 293s 
cells was verified by radioimmunoprecipitation using 
rabbit polyclonal antisera to MN-rgpl20. When the 
binding of the 13H8 monoclonal antibody to a set of 
mutants incorporating substitutions at position 423 and 

20 429 was examined, it was found that none of the 

replacements effected the binding of this antibody 
(data not shown) . When the 5C2 monoclonal antibody was 
examined, it was found that the F for I replacement 
(MN.423 F) conferred partial reactivity (Table 5). 

25 When the double mutant (MN.423F, 429E) , containing the F 
for I substitution as well as the E for K substitution 
was tested, binding that was indistinguishable from 
that to IIIB-rgpl20 was observed (Table 5) . These 
results demonstrated that F at position 423 and E at 

30 position 429 both play a role in binding of the 5C2 
monoclonal antibody, and suggest that the strain 
specificity of 5C2 can be attributed to the residues at 
these positions. 

Examination of the sequences of gpl20 from the 

35 various clones of LAI that have been analyzed revealed 
that several substrains of LAI differed from each other 
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in the C4 domain. Thus the sequences of the IIIB (30) , 
Bru (46) , and HXB3 (6) clones of LAI were identical at 
positions 423 and 429 where F and E residues occurred 
respectively. However, -the sequence of the HXB2 
5 substrain (36) differed from the others at these 

positions where, like MN-rgpl20, K replaced E and at 
position 423 where I replaced F (Figure 5) • Similarly, 
the HXIO and BHIO substrains (36, 37) differed only at 
position 423 where, like KlV-l^, I replaced F. Based 

10 on the mutagenesis experiments above, it would be 

predicted that monoclonal antibody 1024 should be able 
to bind to gpl20 from the HXB2 substrain of LAI, but 
not the HXB3 substrain. If I423 was important for 
binding, then 1024 should also bind the HXIO substrain. 

15 To test this hypothesis, the binding of monoclonal 

antibody 1024 to the surface cells infected with either 
IIIB, HXB2, HXB3, and HXIO substrains of HlV-l^^was 
measured by flow cytometry. It was found that 
monoclonal antibody 1024 was able to bind only HXB2 

20 providing further confirmation that residues 423 and 
429 were important for the binding of this antibody. 
The fact that monoclonal antibody 1024 did not bind to 
HXIO infected cells suggested that I423 was not important 
for the binding of this monoclonal antibody. Thus 

25 these studies demonstrate that reactivity with the 1024 
monoclonal antibody segregates with the occurrence of F 
and E residues at positions 423 and 429, respectively, 
and shows that substrains of HIV-I^ai differ from one 
another at a functionally significant epitope in the 04 

30 domain. 

Keutralizing activity of CD4 bloc]cing antibodies 
correlates with their binding affinity. To account for 
the difference in virus neutralizing activity between 
35 the CD4 blocking monoclonal antibodies, their gpl20 
binding affinities were determined by competitive 
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binding of [^^Sjj ^labeled monoclonal antibody to rgpl20 
(Table 6) . Typical Scatchard analysis of data from 
these assays is shown in Figure 7 (A to C) . Linear, 
one-site binding kinetics were observed for all the 
5 monoclonal antibodies to MN-rgpl20, suggesting that 
only a single class of sites was recognized, and that 
there was no cooperativity between two combining sites 
of each immunoglobulin molecule. It was found 
(Figxire 7A, Table 6) that monoclonal antibody 1024, 

10 which exhibited the most potent virus neutralizing 

activity (IC5oOf 0*08 /ig per ml), possessed the lowest 
Kj (2.7 nM) - In contrast (Figure 7C, Table 6), 
monoclonal antibody 1112, the antibody that exhibited 
the weakest virus neutralizing activity (IC50 of 30 fig 

15 per ml) possessed the highest (20 nM) . K^s for six 
additional CD4 -blocking monoclonal antibodies raised 
against MN-rgpl20 were also determined (Table 6) . It 
was found that monoclonal antibodies that possessed 
intermediate K^s similarly possessed intermediate 

20 neutralization IC50 values ♦ To explore the relationship 
between virus neutralizing activity and gpl20 binding 
affinity, the data in Table 6 was plotted in several 
different ways. It was found that when the of the 
monoclonal antibodies was plotted as a function of the 

25 log of the IC50, a linear relationship was obtained 
(Figure 8) . Using this analysis a correlation 
coefficient (r) of 0.97) was obtained. Thus, this 
graph demonstrates that the virus neutralizing activity 
of these monoclonal antibodies is directly proportional 

30 to the gpl20 binding affinity, and that the threshold 
for neutralization at this epitope is defined by the 
slope of the graph in Figure 8. 

A similar analysis was performed with the non- 
neutralizing CD4 blocking monoclonal antibodies to 

35 IIIB-rgpl20, 5C2 and 13H8- The binding curve for 13H8 
(Figure 7C) showed that it bound to a single class of 
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sites on IIIB-rgpl20 with a of 22 nM. The affinity 
of 5C2 could not be determined by this assay because at 
antibody concentrations greater than 5 nM, non-linear 
(reduced gpl20 binding) was observed. This effect was 
5 suggestive steric hindrance at these concentrations or 
negative cooperativity between combining sites. The 
binding affinity was also determined for the non- 
neutralizing, non-CD4 blocking monoclonal antibody to 
MN-rgpl20, 1086. The fact that this antibody exhibited 
10 a binding affinity similar (9.7 nM) to many of the 
neutralizing monoclonal antibodies but failed to 
inhibit infectivity, proves that high antibody binding 
affinity alone is not sufficient for neutralization. 

15 Effect of C4 Domain Mutants on CD4 binding. 

Finally, the CD4 binding properties of the series of 
MN-rgpl20 mutants, constructed to localize the C4 
domain epitopes, were measiired in a qualitative co- 
immunoprecipitation assay. In these studies the 

20 ability of the mutagenized MN-rgpl20 variants to co- 
immunoprecipitate CD4 was evaluated as described 
previously (21) in a qualitative co-immunoprecipitation 
assay similar to that described previously (19) . 
Briefly, 293 cells, transfected with plasmids directing 

25 the expression of MN-rgpl20 variants described in 
Figure 5, were metabolically labeled with 
[^^S] -methionine, and the growth conditioned cell 
culture supernatants were incubated with rsCD4. The 
resulting rsCD4:gpl20 complexes were then 

30 immunoprecipitated by addition of the CD4 specific 
monoclonal antibody, 465 (A) or a positive control 
monoclonal antibody (1034) directed to the V3 domain of 
MN-rgpl20 (B) . The immunoprecipitated proteins were 
resolved by SDS-PAGE and visualized by autoradiography 

35 as described previously (3). The samples were: Lane 
1, MN.419A; lane 2, MN.421A; lane 3, MN.429E; lane 4, 
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MN.429A; lane 5, MN.432A; lane 6, MN.440A; lane 7, 
MN-rgpl20. The gel show d that the mutants that block 
antibody binding do not block binding of CD4. 
Therefore, the antibodies do not bind to the gpl20 CD4- 
5 binding contact residues. This indicates that steric 
hinderance may inhibit antibody binding, rather than 
that the antibodies bind directly to the CD4 contact 
residues to inhibit binding. 

It was found that all of the variants in which 

10 apolar A residue was substituted for the charged K or E 
residues (e.g., MN.419A, MN.421A, MN.432A, and MN.440A) 
were still able to co-immunoprecipitate rsCD4. 
Similarly, the replacement of E for K at position 429 
(MN.429E), the replacement of F for I at position 423 

15 (MN.423F) or the mutant which incorporated both 

mutation (MN.423F,429E) also showed no reduction in 
their ability to co-immunoprecipitate rsCD4. Thus, 
radical amino acid sxibstitutions at five positions 
failed to affect the binding of gpl20 to CD4. These 

20 results were consistent with previous studies (5, 21, 
34) where it was found that only a few of the many 
mutations that have been induced in this region 
effected CD4 binding. 

This study indicates that neutralizing epitopes in 

25 the C4 domain have now been found to be located between 
about residues 420 and 440. In addition, the critical 
residues for antibody binding are residues 429 and 432. 

EXAMPLE 2 

30 Identification of V2 Neutralizing Epitopes 

The procedures described in Example 1 were used to 
map epitopes in the V2 region of gpl20. Table 7 
illustrates the results of mutagenicity studies to map 
V2 neutralizing epitopes. In the table, the coltamns 

35 indicate the comparison of binding of the monoclonal 
antibodies with wild type (WT) gpl20 in comparison to 
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10 



various mutations of 9pl20 using standard notation. 
For example, "G171R" indicates that the glycine (G) at 
residue 171 has been replaced by an arginine (R) . 
"172 A/ 173 A" indicates that the residues at 172 and 173 
have been replaced by alanine. The neutralizing 
monoclonal antibodies tested (MAbs) are listed in the 
rows. The numerical values in the table are the 
optical density value of an ELISA assay performed as 
described in Example 1 to measure the amount of 
antibody binding. The underlined values indicate 
significantly reduced binding, indicating the 
substituted residue is critical for binding of the 
antibody. 



15 



TABLE 7 



WT G171R, 172A/ E187V 187V/188S 

M174V 173A 



MAbs 





6E10 


1.00 


0.10 


1.28 


0.60 


0.?5 




1017 


1.00 


0.70 


1.10 


0.87 


0.04 


20 


1022 


1.00 


0.80 


1.10 


1.00 


0.00 




1028 


1.00 


0.90 


1.18 


1.07 


0.04 




1029 


1.00 


0.83 


1.16 


1.01 


0.16 




1019 


1.00 


0^13 


1.30 


0.75 


0.74 




1027 


1.00 


o>oo 


1.20 


0.80 


0.64 


25 


1025 


1.00 


0.69 


0.00 


0.00 


0.83 




1088 


1.00 


0.73 


1.12 


0.94 


0.03 




13H8 


1.00 


0.77 


0.78 


0.48 


0.65 
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TABLE 7 (continued) 





WT 


X77A 


X / ^A/ X / JA 


XOOA 




MAPS 














1 nn 


n 56 


0. 52 


0 • 64 


0.43 


"1 A1 "T 
XOX / 


X • uu 


n 77 


0 77 


0.76 


0. 11 




X • uu 


n R6 


0 . 72 


0 . 14 


0. 00 


1 rt*> Q 
o 
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0.41 
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1027 


1.00 


0.00 


0.02 


0.41 


0.49 


1025 


1.00 


0-75 


0.0 


0.83 


0.72 


1088 


1.00 


0.77 


0.77 


0.53 


0.00 


13H8 


1.00 


0.72 


0.72 


0.53 


0.60 



15 As illustrated in Table 7, the study demonstrated 

that there are a series of overlapping neutralizing 
epitopes from been found to be located in the V2 region 
(residues 163 through 200) , with most of the epitopes 
located between residues 163 and 200. In addition, the 

20 study indicates that the critical residues in the V2 

domain for antibody binding are residues 171, 173, 174, 
177, 181, 183, 187, and 188. 

EXAMPLE 3 

25 Jiiunujii nation Studies 

gpl20 from the MN, GNEg, and GNE,^ strains of HIV 
was prepared by amplifying the gene from each isolate 
and cloning and expressing the gene in CHO cells as 
described in Berman et al., J. Virol. 66:4464-4469 

30 (1992). Briefly, the gpl60 gene was amplified with two 
r xinds of amplification using the following nested 



-70- 



wo 94/28929 



PCT/DS94/06036 



primers according to the protocol by Kellog et al-, pp 
337-347 in PCR Protocols: a guide to m thods and 
amplification. Innis et al. (eds.) Academic Press, 
Inc. , New York. 
5 First round primers: 

AATAATAGCAATAGTTGTGTGGWCC (W is A or T) 
ATTCTTTCCCTTAYAGTAGGCCATCC (Y is T or C) 
Second round primers: 

GGGAATTCGGATCCAGAGCAGAAGACAGTGGCAATGA 

1 0 GTCAAGAATTCTTATAGCAAAGCCCTTTCCAA 

The primers are SEQ. ID. NOs. 31-34. Each gene is then 
digested with the restriction endonucleases Kpnl and 
Accl. The resulting fragment was subcloned into the 
Bluescript (+) phagemid M13 vector (Stratagene, Inc.) 

15 and sequenced by the dideoxynucleotide method (Sanger 
et al., Proc. Natl. Acad. Sci. USA 74:5463-5467 
(1977)). 

A fragment of the gpl20 coding region was then 
used to construct a chimeric gene for expression in 

20 mammalian cells, as described in Lasky et al.. Science 
223:209-212 (1986). The 5» end was fused to a 
polylinker adjacent to a simian virus 40 (SV40) 
promoter and the 3 ' end was fused to a polylinker 
adjacent to the 3* untranslated sequences containing an 

25 SV40 polyadenylation signal. The expression vector 

(MN-rgpl20) was co-transf ected in CHO cells deficient 
in production of the enzyme dihydrofolate reductase, 
along with a plasmid (pSVdhfr) containing a cDNA 
encoding the selectable marker, dihydrofolate 

30 reductase. Cell lines expressing MN-rgpl20 were 
isolated as described in Lasky et al.. Science 
223:209-212 (1986). The recombinant glycoprotein was 
purified from growth-conditioned cell culture medium by 
immunoaf finity and ion exchange chromatography as 

35 described in Leonard et al., J. Siol. Chem. 265:10373- 
10382 (1990) . 
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gpl20 from the GNEg and GNE,^ strains of HIV is 
prepared in the same manner as described for the MN 
isolate. 

MN-rgpl20 (300 Mg/injection) , GNEj-rgpl20 (300 /xg/ 
5 injection), and GNE,4-rgpl20 (300 Mg/ injection) are 
prepared in an aluminum hydroxide adjuvant (as 
described in Cordonnier et al.. Nature 340:571-574 
(1989)). Six chimpanzees are injected at 0, 4, and 32 
weeks. Sera are collected and assayed for neutralizing 

10 antibody to each strain of HIV at the time of each 

immunization and three weeks thereafter. At 35 weeks, 
each of the chimpanzees has significant levels of 
neutralizing antibodies to each strain. 

At 35 weeks, the chimpanzees are randomly assigned 

15 to three groups. Each group is challenged with about 
10 50% chimpanzee-infectious doses (CID50) each of one 
of the vaccine isolates. One unimmunized chimpanzee 
(control) is also injected with the same amount of 
virus as the immunized chimpanzees for each vaccine 

20 strain. 

Sera are drawn every two weeks throughout the 
study and assayed for antibodies to HIV core proteins 
and for the presence of HIV by PGR amplification and 
co-cultivation of peripheral blood mononuclear cells 

25 (PBMCs) from the chimpanzee together with activated 

hiiman or chimpanzee PBMCs. The presence of antibodies 
to core proteins indicates the presence of viral 
infection as does the detection of amplified viral DNA 
or viral infection of co-cultivated cells. 

30 The presence of virus is detected by PGR and 

co-cultivation methods in each unimmunized control 
animal between weeks 2 and 4 post challenge. 
Antibodies to core proteins appear in the control 
chimpanzees at six weeks post challenge. Neither virus 

35 nor antibodies are at detectable levels in any of the 
immunized chimpanzees at one year post challenge, 
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indicating that the vaccine effectively protects the 
chimpanzees from infection from each of the challenge 
strains. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
5 

(i) APPLICANT: Herman, Phillip W. 

Nakainura, Gerald R« 

(ii) TITLE OF INVENTION: HIV Envelope Polypeptides 

10 

(iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Skjerven, Morrill, MacPherson, Franklin & 
15 Friel 

(B) STREET: 25 Metro Drive Suite 700 

(C) CITY: San Jose 

(D) STATE: California 

(E) COUNTRY: U.S.A. 
20 (F) ZIP: 95110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
30 (B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: Terlizzi, Laura 
35 (B) REGISTRATION NUMBER: 31,307 

(C) REFERENCE/ DOCKET NUMBER: M-2820-1F 

(ix) TELECOMMUNICATION INFORMATION:' 
(A) TELEPHONE: (408) 283-1222 
40 (B) TELEFAX: (408) 283-1233 



(2) INFORMATION FOR SEQ ID N0:1: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
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Met Arg Val Lys Gly lie Arg Arg Asn Tyr Gin His Trp Trp Gly Arg 
15 10 15 

Gly Thr Met Leu Leu Gly Leu Leu Met lie Cys Ser Ala Thr Glu Lys 
5 20 25 30 

Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 
35 40 45 

10 Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Ala 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

15 

Gin Glu Val Glu Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Glu Gin Met His Glu Asp He He Ser Leu Trp Asp 
20 100 105 110 

Gin Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

25 Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn Thr Asn Asn Ser Thr Asp 
130 135 140 

Asn Asn Asn Ser Lys Ser Glu Gly Thr He Lys Gly Gly Glu Met Lys 
145 150 155 160 

30 

Asn Cys Ser Phe Asn He Thr Thr Ser He Gly Asp Lys Met Gin Lys 
165 170 175 

Glu Tyr Ala Leu Leu Tyr Lys Leu Asp He Glu Pro He Asp Asn Asp 
35 180 185 190 

Ser Thr Ser Tyr Arg Leu He Ser Cys Asn Thr Ser Val He Thr Gin 
195 200 205 

40 Ala Cys Pro Lys He Ser Phe Glu Pro He Pro He His Tyr Cys Ala 
210 215 220 

Pro Ala Gly Phe Ala He Leu Lys Cys Asn Asp Lys Lys Phe Ser Gly 
225 230 235 240 

45 

Lys Gly Ser Cys Lys Asn Val Ser Thr Val Gin Cys Thr His Gly He 
245 250 255 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 
50 260 265 270 

Glu Glu Val Val He Arg Ser Glu Asp Phe Thr Asp Asn Ala Lys Thr 
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275 280 285 

He He Val His I*eu Lys Glu Ser Val Gin He Asn Cys Thr Arg Pro 
290 295 300 

5 

Asn Tyr Asn Lys Arg Lys Arg He His He Gly Pro Gly Arg Ala Phe 
305 310 315 320 

Tyr Thr Thr Lys Asn He Lys Gly Thr He Arg Gin Ala His Cys He 
10 325 330 335 

He Ser Arg Ala Lys Trp Asn Asp Thr Leu Arg Gin He Val Ser Lys 
340 345 350 

15 Leu Lys Glu Gin Phe Lys Asn Lys Thr He Val Phe Asn Pro Ser Ser 
355 360 365 

Gly Gly Asp Pro Glu He Val Met His Ser Phe Asn Cys Gly Gly Glu 
370 375 380 

20 

Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe Asn Ser He Trp Asn Gly 
385 390 395 400 

Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser Asn Asn Asn He Thr Leu 
25 405 410 415 

Gin Cys Lys He Lys Gin He He Asn Met Trp Gin Lys Val Gly Lys 
420 425 430 

30 Ala Met Tyr Ala Pro Pro He Glu Gly Gin He Arg Cys Ser Ser Asn 
435 440 445 

He Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Asp Thr Asp Thr 
450 455 460 

35 

Asn Asp Thr Glu He Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn 
465 470 475 480 

Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Thr He Glu Pro Leu 
40 485 490 495 

Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val Val Gin Arg Glu 
500 505 510 

(ajB INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 



81 



W0 5W/28929 



PCT/US94/06036 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Tyr Ala Leu Ala Asp Ala Ser Leu Lys Met Ala Asp Pro Asn Arg 
15 10 15 

5 

Phe Arg Gly Lys Asp Leu Pro Val Leu Asp Gin Leu Leu Glu Val Pro 
20 25 30 

Val Trp Lys Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys 
10 35 40 45 

Ala Tyr Asp Thr Glu Ala His Asn Val Trp Ala Thr His Ala Cys Val 
50 55 60 

15 Pro Thr Asp Pro Asn Pro Gin Glu Val Glu Leu Val Asn Val Thr Glu 
65 70 75 80 

Asn Phe Asn Met Trp Lys Asn Asn Met Val Glu Gin Met His Glu Asp 
85 90 95 

20 

lie lie Ser Leu Trp Asp Gin Ser Leu Lys Pro Cys Val Lys Leu Thr 
100 105 110 

Pro Leu Cys Val Thr Leu Asn Cys Thr Asp Leu Arg Asn Thr Thr Asn 
25 115 120 125 

Thr Asn Asn Ser Thr Asp Asn Asn Asn Ser Lys Ser Glu Gly Thr lie 
130 135 140 

30 Lys Gly Gly Glu Met Lys Asn Cys Ser Phe Asn lie Thr Thr Ser lie 
145 150 155 160 

Gly Asp Lys Met Gin Lys Glu Tyr Ala Leu Leu Tyr Lys Leu Asp lie 
165 170 175 

35 

Glu Pro lie Asp Asn Asp Ser Thr Ser Tyr Arg Leu lie Ser Cys Asn 
180 185 190 

Thr Ser Val lie Thr Gin Ala Cys Pro Lys lie Ser Phe Glu Pro lie 
40 195 200 205 

Pro lie His Tyr Cys Ala Pro Ala Gly Phe Ala lie Leu Lys Cys Asn 
210 215 220 

4 5 Asp Lys Lys Phe Ser Gly Lys Gly Ser Cys Lys Asn Val Ser Thr Val 
225 230 235 240 

Gin Cys Thr His Gly He Arg Pro Val Val Ser Thr Gin Leu Leu Leu 
245 250 255 

50 

Asn Gly Ser Leu Ala Glu Glu Glu Val Val He Arg Ser Glu Asp Phe 
260 265 270 
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Thr Asp Asn Ala Lys Thr lie lie Val His Leu Lys Glu Ser Val Gin 
275 280 285 

He Asn Cys Thr Arg Pro Asn Tyr Asn Lys Arg Lys Arg He His He 
5 290 295 300 

Gly Pro Gly Arg Ala Phe Tyr Thr Thr Lys Asn He Lys Gly Thr He 
305 310 315 320 

10 Arg Gin Ala His Cys He He Ser Arg Ala Lys Trp Asn Asp Thr Leu 

325 330 335 

Arg Gin He Val Ser Lys Leu Lys Glu Gin Phe Lys Asn Lys Thr He 
340 345 350 

15 

Val Phe Asn Pro Ser Ser Gly Gly Asp Pro Glu He Val Met Hxs Ser 
355 360 365 

Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe 
20 370 375 380 

Asn Ser He Trp Asn Gly Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser 
385 390 395 400 

25 Asn Asn Asn He Thr Leu Gin Cys Lys He Lys Gin He He Asn Met 

405 410 415 

Trp Gin Lys Val Gly Lys Ala Met Tyr Ala Pro Pro He Glu Gly Gin 
420 425 430 

30 

He Arg Cys Ser Ser Asn He Thr Gly Leu Leu Leu Thr Arg Asp Gly 
435 440 445 

Gly Glu Asp Thr Asp Thr Asn Asp Thr Glu He Phe Arg Pro Gly Gly 
35 450 455 460 

Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val 
465 470 475 480 

40 Val Thr He Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg 

485 490 495 

Val Val Gin- Arg Glu 
500 

45 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Lys He Lys Gin He He Asn Met Trp Gin Lys Val Gly Lys Ala 
5 1 5 10 1^ 

Met Tyr Ala Pro Pro He Glu Gly Gin He Arg Cys 
20 25 

(2P INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 

CVS Arq He Lys Gin Phe He Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

Met Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys 
25 20 25 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Cys Arg He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

40 

Met Tyr Ala Pro Pro He Lys Gly Gin He Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 6: 
45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Gly Val Gly Lys Ala 
15 10 15 

5 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Asn Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 7: 
10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

20 Cys Arg lie Lys Gin lie lie Asn Arg Trp Gin Glu Val Gly Lys Ala 
1 5 • 10 15 

lie Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Cys Arg lie Lys Gin lie Val Asn Met Trp Gin Arg Val Gly Gin Ala 
15 10 15 

4 0 Met Tyr Ala Pro Pro lie Lys Gly Val lie Lys Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 9: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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Cys Lys lie Lys Gin He He Asn Met Trp Gin Gly Ala Gly Gin Ala 
1 5 10 15 

Met Tyr Ala Pro Pro He Ser Gly Thr He Asn Cys 
5 20 25 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Cys Arg He Lys Gin Phe He Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

20 

Met Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 11: 
25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

35 Cys Lys He Lys Gin He He Asn Met Trp Gin Lys Val Gly Lys Ala 
15 10 15 

Met Tyr Ala Pro Pro He Ser Gly Gin He Arg Cys 
20 25 

40 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Cys Arg He Lys Gin He He Asn Met Trp Gin Glu Val Gly Lys Ala 
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10 15 



Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

20 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 14: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Ser Gly Gly Asp Pro Glu lie Val Met His Ser Phe Asn Cys Gly Gly 
35 1 5 10 15 

Glu Phe Phe Tyr Cys Asn Thr Ser Pro Leu Phe Asn Ser lie Trp Asn 
20 25 30 

40 Gly Asn Asn Thr Trp Asn Asn Thr Thr Gly Ser Asn Asn Asn lie Thr 
35 40 45 

Leu Gin Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly 
50 55 60 

45 

Lys Ala Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys Ser Ser 
65 70 75 80 

Asn lie Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly 
50 85 90 

(2) INFORMATION FOR SEQ ID NO: 15: 

87 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

10 Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

15 

(2) INFORMATION FOR SEQ . ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Ala Val Gly Lys Ala 
15 10 15 

30 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Cys Ala lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
45 1 5 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(ap INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Cys Lys lie Ala Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
10 1 5 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(2f INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

25 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Ala Ala 
15 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
30 20 25 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Cys Lys lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
1 5 10 15 

45 

Met Tyr Ala Pro Pro He Ala Gly Gin He Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 21: 
50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Cys Arg lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 
15 10 15 

10 

Met Tyr Ala Pro Pro lie Ser Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 22: 
15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

25 Cys Lys lie Lys Gin Phe lie Asn Met Trp Gin Lys Val Gly Lys Ala 
15 10 15 

Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

30 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Cys Lys lie Lys Gin Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala 
1 5 10 15 

45 Met Tyr Ala Pro Pro lie Glu Gly Gin lie Arg Cys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 24: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
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5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24; 

Phe lie Asn Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro Pro 
1 5 10 15 

10 He Ser 



(2) INFORMATION FOR SEQ ID NO: 25: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Trp Gin Glu Val Gly Lys Ala Met Tyr Ala Pro 
25 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Lys Ala Met Tyr Ala Pro Pro He Lys Gly Gin He Arg 
15 10 

40 
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WHAT IS CLAIMED IS: 



1. A method for making an HIV gpl20 subunit vaccine 
for a geographic region comprising the steps of: 

5 a. determining a neutralizing epitope in the V2 

or C4 domain of gpl20 of HIV isolates from 
the geographic region; 

b. selecting an HIV strain having gpl20 which 
has a neutralizing epitope in the V2 or C4 

10 domain which is common among isolates in the 

geographic region; and 

c. making an HIV gpl20 subunit vaccine from the 
selected isolate. 

2. The method of Claim 1 wherein the neutralizing 
15 epitope is determined by determining the amino 

acid sequence for at least a portion of the V2 or 
C4 domain. 

3* The method of Claim 2 wherein the amino acid 

sequence is determined by setjuencing DNA encoding 
20 at least a portion of the V2 or C4 domain. 

4. The method of Claim 3 wherein a plurality of 
isolates having different amino acid sequences for 
the V2 and C4 domains are selected. 

5. The method of Claim 4 wherein a plurality of 

25 isolates having different amino acid sequences for 

the V3 domain is selected. 

6. A method for making an HIV gpl20 subunit vaccine 
for a geographic region comprising the steps of: 

a. determining neutralizing epitopes for the V2, 
30 V3, and C4 domains of gpl20 from HIV isolates 

from the geographic region; 

b. selecting at least two HIV isolates having 
different neutralizing epitopes in the V2, 
V3, or C4 domain; and 

35 c- making an HIV gpl20 subunit vaccine from the 

selected isolates • 



-92- 



wo 94/28929 

7. 
8. 

5 
10 
15 

9. 
10. 

20 

11. 

12. 
25 13. 

14. 

30 

15. 

35 



PCT/US94/06036 



The method of Claim 6 wherein each of the selected 
isolates have one of the most common neutralizing 
epitopes. 

A method for making an HIV gpl20 subunit vaccine 
for a geographic region comprising the steps of: 

a. determining the neutralizing epitopes for HIV 
isolates and the percentage of HIV infections 
attributable to each strain present in the 
region ; 

b. selecting at least two HIV strains which have 
the most common neutralizing epitopes in the 
V2, V3, and C4 domains in the geographic 
region; and 

c. making an HIV gpl20 subunit vaccine from the 
selected isolates. 

The method of Claim 8 wherein the isolates are 
primary patient isolates. 

The method of Claim 8 wherein the geographic 
region is North America and the amino acid 
sequence of gpl20 from the HIV isolates MN and GNEg 
are selected. 

The method of Claim 10 wherein the GNEi^ isolate is 
also selected. 

A multivalent HIV gpl20 subunit vaccine. 
The vaccine of Claim 12 wherein gpl20 present in 
the vaccine is from at least two HIV isolates 
which have a different neutralizing epitope in the 
V2 or C4 domain of gpl20. 

The vaccine of Claim 12 wherein gpl20 present in 
the vaccine is from at least two HIV isolates 
which have a different neutralizing epitope in the 
V3 domain of gpl20. 

The vaccine of Claim 12 wherein each isolate has a 
different common neutralizing epitope for the V2 
or C4 domains of gpl20. 
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The vaccine of Claim 1 wherein gpl20 present in 
the vaccine is from the MN and GNEg strains of HIV. 
The vaccine of Claim 1 wherein gpl20 from the GN£]^ 
strain of HIV is also present in the vaccine. 
A DNA sequence of less than 5 kilobases encoding 
gpl20 from GNEg and having the nucleotide sequence 
illustrated in Table 1. 

A DNA sequence of less than 5 kilobases encoding 
gpl20 from GNE15 and having the nucleotide sequence 
illustrated in Table 2. 

An expression construct comprising DNA encoding 
gpl20 selected from the group consisting of 
GNE8-gpl20 and GNEi4-gpl20 under the transcriptional 
and trans lational control of a heterologous 
promoter . 

The expression construct of Claim 20 wherein the 
promoter is a eukaryotic promoter. 
The expression construct of Claim 21 wherein the 
DNA encoding gpl20 is joined to a heterologous 
signal sequence. 

An isolated GNEg-gpl20 polypeptide having the amino 

acid sequence illustrated in Table 1. 

An isolated GNE,5-gpl2p polypeptide having the 

amino acid sequence illustrated in Table 2. 

An improved serotyping method for HIV strains 

comprising determining the serotypes of the V2, 

V3, and C4 domains of gpl20. 

A truncated gpl20 sequence which sequence is free 
from the C5 domain. 

The truncated gpl20 sequence of Claim 26 wherein 
the sequence is additionally free from the carboxy 
terminus through amino acid residue 453 of the 
gpl20 V5 domain. 

The truncated gpl20 sequence of Claim 27 wherein 
the sequence is additionally free from the gpl20 
V5 domain. 
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29. The truncated gpl20 sequence of Claim 26 wherein 
the sequence is additionally free fr m the gpl20 
signal sequence. 

30. The truncated gpl20 sequence of Claim 29 wherein 

5 the sequence is additionally free from the carboxy 

terminus through amino acid residue 111 of the 
gpl20 CI domain. 

31. The truncated gpl20 sequence of Claim 29 wherein 
the sequence is additionally free from the carboxy 

10 terminus through amino acid residue 117 of the 

gpl20 CI domain. 

32. The truncated gpl20 sequence of Claim 26 wherein 
the sequence is free from the amino terminus of 
gpl20 through residue 111 of the CI domain and 

15 residue 453 through the carboxy terminus of gpl20. 

33. The truncated gpl20 sequence of Claim 26 wherein 
the sequence is produced by recombinant 
engineering. 
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